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Introduction 



These notes are intended for readers who have already had a solid course in 
contemporary symbolic logic. I assume that readers know how to interpret (informally) 
the symbols of symbolic logic and have learned how to do proofs in a natural deduction 
system such as that found in Jon Barwise and John Etchemendy's textbook, Language, 
Proof and Logic. The course picks up at the point where students need to learn a precise 
definition of truth in a structure and learn to use it to demonstrate the validity and 
invalidity of arguments. 

At the University of Cincinnati, the Philosophy Department teaches a two-quarter logic 
course. (Quarters are 1 1 -week terms.) In the first eight weeks of the first quarter, the 
students study material in the first 13 chapters of the Barwise and Etchemendy's 
textbook. (We skip some sections.) These notes pertain to the last two weeks of the first 
quarter and all of the second quarter. 

If you, dear reader, discover any errors, typographical or worse, I would be grateful for 
your feedback. Contact me at: christopher.gauker@uc.edu 

Outline of course 

Here, in outline, is what we will do in this course: 

I. Define first-order validity precisely. To do that we need definitions of structures, 
variable assignments, satisfaction by a variable assignment in a structure, and truth 
in a structure. 

II. Prove soundness and completeness. That is, we prove that the class of first-order 
valid arguments exactly coincides with the class of arguments whose conclusions 
can be derived from their premises using our inference rules. 

Dividends that we will reap from this include: 

A. The Lowenheim-Skolem Theorem. If a theory (a set of sentences) has a model 
(a structure in which all of the sentences in the set are true), then it has a model 
with a denumerable domain. (So even a theory of the real numbers will be 
interpretable as true in a model whose domain contains just the positive 
integers.) 
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B. The Compactness Theorem. If every finite subset of a set of sentences is 
consistent, then the whole set is consistent. This does not look very exciting, 
but it will be exciting to discover, later on, that second-order logic is not 
compact. 

III. Prove Godel's First Incompleteness Theorem. What this says is that the truths of 
arithmetic are not axiomatizable. That is, there is no decidable, consistent set of 
sentences such that all of the truths of arithmetic are first-order consequences of the 
sentences in that set. Not even if the set of axioms is infinite (so long as it is 
decidable). 

Some dividends that we will reap along the way include: 

A. We will acquire the concepts of decidability and enumerability and will learn 
how they can be defined in a precise way in terms of the formulas of arithmetic. 

B. We will prove Tarski's undefinability theorem, which says that no bivalent 
language can contain its own truth predicate. 

Actually, we will prove the Godel theorem in several different ways and in two 
importantly different versions. 

IV. Undecidability of first-order logic. First-order logic is axiomatizable. (For 
instance, our Fitch inference rules constitute such an axiomatization.) However, it 
is not decidable. That is, there is no algorithm that will take any argument and tell 
us whether or not that argument is first-order valid. This will follow pretty quickly 
from some results that we will have proved on the way to the second version of 
Godel's First Incompleteness Theorem. 

V. Godel's Second Incompleteness Theorem. We will make short schrift of this by 
starting with some big assumptions that everyone believes but no one bothers to 
prove. 

VI. Second-order logic. Second-order logic is just like first-order logic, except that we 
will have variables that stand in predicate position and we will have quantifiers that 
bind those variables. The important fact about second-order logic that we will learn 
is that, like arithmetic, it is not axiomatizable. We can define logical validity for 
second-order logic, but then we cannot have a set of axioms and inference rules that 
allow us to give proofs for all and only the second-order valid arguments. 
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VII. Modal logic. We will define logical validity for a language containing modal 
operators, such as "necessarily" and "possibly". This gets a bit tricky and 
controversial when we try to add quantifiers as well. This last unit will belong to 
what is called "philosophical logic", whereas everything prior will belong to 
(elementary) mathematical logic. 

Sources 

Authors of logic books are not very good about telling us where they learned what they 
know. Maybe they would like us to believe that they made it up themselves! But I 
certainly did not make this stuff up myself; so here I will list the books from which I have 
learned the things that I explain in these notes: 

Jon Barwise and John Etchemendy, Language, Proof and Logic, CSLI (1999-2002). I 
have taught elementary logic from countless textbooks over the years; this is clearly the 
best. The computer programs that come with the textbook are an excellent teaching tool. 
My proof of the completeness theorem (i.e., the proof of the completeness of the Fitch- 
style natural deduction system with respect to the definition of logical validity) comes 
from chapters 17 and 19 of this book. However, I have filled in many details that they 
omit. Strangely, they do not prove a soundness theorem at all (contenting themselves 
with only the most hand-waving sketch); so I have had to construct that from scratch. 
With axiomatic proof theories, the proof of soundness is quite trivial; it's not quite so 
trivial for natural deduction systems. 

Raymond Smullyan, Godel's Incompleteness Proofs, Oxford University Press (1992). 
This is a brilliant book. Although I had previously taught the proofs of Godel's theorems 
in Enderton and Boolos & Jeffrey, I probably never really understood Godel's theorems 
until I read this book. Smullyan avoids a great deal of complexity by defining recursively 
enumerable sets and relations as 2i sets and relations. (I will explain these concepts 
below.) The proofs of incompleteness that I have given below follow certain threads that 
I have picked out of this book. In one place I refer the reader to this book for a detail that 
I do not myself provide (but which the reader will probably be content to grant without 
proof). Strangely, Smullyan goes almost right up to, but then does not actually prove the 
very general version of Godel's Theorem proved in Enderton and Boolos & Jeffrey, 
which I prove in Lesson 10 below. I don't understand why Oxford cannot bring out an 
inexpensive paperback edition of this book. Maybe it's not Smullyan's fault, but 
somebody did a very bad job with the index and references. 
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George Boolos and Richard Jeffrey, Computability and Logic, 2nd edition (or 3rd 
edition), Cambridge University Press. My proof of the more general version of Godel's 
Theorem borrows some final steps from this book. Also, my proof of the undecidability 
of first-order logic is based on the one in this book. This book is widely used in 
Philosophy, but it's a little strange since it starts out with chapters on computability and 
then does everything else in terms of that. There is a fourth edition, which adds another 
author, John Burgess. That one looks significantly different (the type has been 
completely re-set), but I have never read it. 

Herbert Enderton, Introduction to Mathematical Logic, Academic Press (1972). This was 
once the standard textbook, and maybe it still is if Boolos and Jeffrey has not eclipsed it. 
The proofs are sometimes sketchy, and sometimes key steps do not stand out clearly. 
The proof of the representability (in a subtheory of number theory) of every recursive 
function is very hard to penetrate. In these notes I have relied on this text only in the 
presentation of second order logic. 

G. E. Hughes and M. J. Cresswell, A New Introduction to Modal Logic, Routledge 
(1996). I can't say that I learned what I know about modal logic from this book, since I 
did not even read it until 2004. Nonetheless, this book does contain all of the essential 
information (as well as much more than the essentials). 

Stewart Shapiro, Foundations without Foundationalism: A Case for Second-order Logic, 
Oxford (1991). 



Lesson 1 : First-order Validity 



Throughout, I will assume that we are dealing with a particular first-order language L. I 
will assume that the reader knows the usual formation rules for a first-order language. 

Recall the definition of tautological validity: 

An argument is tautologically valid if and only if for each assignment of truth values 

to the noncompound components, if the premises are true on that assignment, then the 
conclusion is true on that assignment too. 

The definition of first-order validity will look similar: 

An argument is first-order valid if and only if for each structure of the language, if the 
premises are true in that structure, then the conclusion is true in that structure too. 

So, to understand this definition, we need to know: What is a structure? 

Very approximately: A structure is a thing that will assign a definite "meaning" or, more 
accurately, "reference", to each basic vocabulary item, that is, to each name and each 
predicate. Here, for example, are two partial specifications of structures. 

Structure One: 

Interprets "a" as standing for a big tetrahedron in the lower left-hand corner of a certain 
grid. 

Interprets "Cube" as standing for the set of cubes on the grid. 

Interprets "Larger" as standing for the set of pairs, such that the first member larger than 
the second. 

Structure Two: 

Interprets "a" as standing for a small dodecahron in the upper right-hand corner. 
Interprets "Cube" as standing for the dodecs on the left-hand side and the large-blocks 
on the right-hand side. 

Interprets "Larger" as standing for the set of pairs such that the first member is a 
tetrahedron and the second member is a cube. 
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Why this will be a little bit complicated 

Now we want to explain how the truth value of a sentence of a first-order language 
depends on the interpretation of its components. In propositional logic, this is 
straightforward because the truth value of a compound sentence is determined by the 
truth values of its components. 

Example: If "Cube(a)" is False, and "Large(a)" is True, then we know that the 
truth value of "Cube(a) -» Large(a)" is True. 

In first-order logic (quantifier logic), the truth value of a complex formula is not 
determined by the truth values of its components, because the components are not always 
sentences. 

Example: "Vx3yl_ikes(x, y)" has "Likes(x, y)" as a component, but this is neither 
true nor false, because it is not even a sentence. 

The solution will be to proceed in three steps: 

In association with a structure, we will assign "temporary" meanings to variables, via 
variable assignments. 

We will first define "Variable assignment g satisfies formula P in structure M". 

Then, in terms of satisfaction by a variable assignment we will define "Sentence P is true 
in M". 

Notational conventions: 

When I want to talk about an actual formula or vocabulary item of the language £, I will 
put it in quotation marks: 

"Cube(a)" 

"3xLarger(y, x)" 

But sometimes (even very soon) I will omit the quotation marks just to avoid clutter. 
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When I want to talk about all vocabulary items of a certain kind, or all formulas of a 
certain form, I will use bold-face sans serif: 

If n is a name of L, then .... 

For any formula of the form (P v Q), if either. . . 

Note especially: Some formulas of the form 3vP: 

3xCube(x) 

3y(Cube(y) a -Small(y)) 
3x(Cube(x) a Vy(x^y -> Larger(x, y))) 
3x3yTet(y) 

Similarly: Some formulas of the form VvQ: 

VxCube(x) 
Vx-3yl_arger(x, y) 

Vy(Cube(y) -» 3x(Tet(x) a Adjoins(y, x))) 

Note, though, that in subsequent lessons, I will cease to use boldface and just leave it to 
context to determine whether I am talking about a sentence of £ or am using schematic 
letters. 

Sets 

I will assume that the reader has an intuitive understanding of the concept of a set and 
understands that the membership of a finite set can be specified by writing names of the 
members of the set between curly brackets. Also, the order in which we list the members 
does not affect the identity of the set. For example: 

The set consisting of UC philosophy professors = 

{Gauker, Martin, Robinson, Polger, Skipper, Faaborg, Jost, Richardson, Maglo, 
Carbonell, Allen} 



{Robinson, Gauker, Martin, Carbonell, Polger, Skipper, Allen, Faaborg, Jost, 
Richardson, Maglo} 
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Set membership: 

Set membership is indicated with the symbol "G" (a stylized epsilon). 

Gauker G {Gauker, Martin, Robinson, Polger, Skipper, Faaborg, Jost, 
Richardson, Maglo, Carbonell, Allen} . 

Obama ^ {Gauker, Martin, Robinson, Polger, Skipper, Faaborg, Jost, 
Richardson, Maglo, Carbonell, Allen} . 

The empty set: 
0={} 

Do not confuse this with the set containing the empty set: { } 
Unions: "A U B" stands for the union of sets A and B. 

For example, {1, 5, 9} U {3, 9, 12} = {1,3, 5, 9, 12}. 

CO 

A i is the union of the infinite series of sets A\, A 2 , A3, . . . 

i-\ 

Subset: "A G B" means that A is a subset of B. 

{1, 5} C {1,2, 5, 7, 9}. 
{1,5}C{1,5}. 

n-tuples 

A pair (or 2-tuple): (Gauker, Martin) 

A triple (or 3-tuple): (Gauker, Martin, Robinson) 

A one-tuple: (Gauker) 

Order matters: (Gauker, Martin) ^ (Martin, Gauker). 
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Functions 



A function is a thing with inputs and outputs, and for each input there is exactly 
one output. 

Square(x) = y. Square(3) = 9. x 2 = y. 

+(x, y) = z. +(2, 5) = 7. This can be abbreviated: x + y = z. The input is a pair 
<x, y). 

fatherof(x) = y. fatherof(Beau) = Lloyd. 

The domain of a function is the set of things that are inputs to the function. 

The range of a function is the set of things that are outputs of the function for 
some input to the function. 

A function can be thought of as a set of pairs: 

For example, the addition function over positive integers = 

{((1,1), 2), ((1,2), 3), ((2,1), 3), ((2, 2), 4),....} 



The identity relation on D 

The identity relation on a set D is the smallest set of ordered pairs such that for every 
object oGD, (o, o) is a member. 

{(0i,0i), (0 2 , 2 ), (0 3 , 3 ), ....} 



Definition of a structure 

(Other terms for structures: "model", "interpretation".) 

Each structure is a pair. 

M = (Dm, 2m), or just (D, 2), for short. 

Dm, the domain of M, is a set of objects, e.g., {07, 02, 066, ...}. 

The domain must be nonempty. 
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2m is an interpretation: 

For each name n of L, 2ivi(n) G Dm. 

(That is, 2m assigns to each name a member of Dm.) 

Example: 2M("b") = 05, assuming 05 G Dm. 

For each n-place predicate P, 2m(P) = a set of n-tuples whose members are all 
members of Dm. 

For example, 2 M ("Larger") = {(o 2 , o 5 ), (o 5 , o 6 ), (o 2 , o 6 )}, assuming that o 2 , o 5 , 
06 are all members of Dm. 

For example, 2 M ("Cube") = {<o 2 >, (o 7 )}. 
2m("=") = the identity relation on Dm, as defined above. 

Variable assignments g in M. 

The domain of a variable assignment g in M is some subset of the set of variables of the 
language. The domain may be the empty set, 0. 

For each variable v in the domain of g, 
g(v) G D M . 

Examples: g("x") = 03, g("z") = 07, assuming 03, 07 G Dm. 

Strictly speaking, to mark the dependence of g on Dm, we should write gM, but for 
typographical simplicity, I omit the subscript (and I will soon start omitting the subscripts 
elsewhere as well). 

Variants of variable assignments 

g[v/o] is the variable assignment just like g except that g[v/o] assigns to v instead of 
whatever g assigns to v. 

Let's say that g[v/o] is the "v-o variant of g". 
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Example: 

Suppose the domain of g is {"x", "y", "z"}, and 
g("x") = oi. 

g("y") = 02. 

g("Z") = 03. 

In that case, g["y'7o 4 ] (the "y''-04 variant of g) is the following function: 

g["y"/o 4 ]("x") = o 1 . 
g["y"/o 4 ]("y") = 04. 
g["y"/o 4 ]("z") = o 3 . 

Andg["y'7o 4 ]["z"/o 2 ] is the following function: 

g["y"/o 4 ]["z"/o 2 ]("x") = oi. 
g["y"/o 4 ]["z"/o 2 ]("y") = o 4 . 
g["y"/o 4 ]["z"/o 2 ]("z") = o 2 . 

Andg["y"/o 4 ]["y''/o 5 ] is the following function: 

g["y"/o 4 ]["y"/o 5 ]("x") = oi. 
g["y"/o 4 ]["y"/o 5 ]("y") = o 5 . 
g["y"/o 4 ]["y"/o 5 ]("z") = o 3 . 

In this last example, 'g["y'7o4]["y'7o5]' is the name of the function. What we write after 
the name of the function between round parentheses is the input to the function. 
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Term assignments 



Names and variables are both called "terms". 



Where t is a term and g is some variable assignment in M, 



2ivi(t) if t is a name. 



*(t) 



g(t) if t is a variable. 



h is a "term assignment for M and g". 

So, in other words a term assignment h combines the functions 2m and g. Strictly 
speaking, to mark the dependence of h on M, we should write Am, but to avoid notational 
clutter we will not. 

For example: 



Suppose: 



2 M ("b") = o 2 . 
g("z") = o 7 . 



In that case, 



A("b") = o 2 . 
A("z") = o 7 . 



And, 



</*("b"), A("z")> = <2 M ("b"), g("z")> = <o 2 , o 7 >. 



We will also have variants of such term assignments. Thus: 




g[v/o](t) if t is a variable. 



2M(t) if t is a name. 
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Barwise and Etchemendy notation 

For users of Barwise and Etchemendy, I should note that the present notation differs from 
theirs in several ways. 

Instead of M, they write TO. 

Instead of D M , they write H0(V) and D™. 

Instead of 2ivi("Cube"), they write "Cube"™ (but they omit the quotation marks). 

And instead of 2 M (P), they write P w . 

lit 11™ 

Instead of h(t), they write: IIMIg 

The good thing about their notation is that it makes explicit the relativity of term 
assignments to g. 

From now on, I will drop the subscript "M" on "Dm" and "2m", just to reduce the clutter. 
(But don't forget that it's "really there".) 



Satisfaction of an atomic formula by a variable assignment in a structure: 

Towards defining the conditions under which a variable assignment satisfies a formula in 
a structure, we first define the conditions under which a variable assignment satisfies an 
atomic formula in a structure: 



g satisfies R(t-i, t 2 t n ) in M if and only if: 

<A(ti), h(t 2 ), ...,/*(t„))e2 M (R). 
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Example: "Larger(b, y)" 
Suppose M = (D, 2), where 

D = {oi, o 2 , o 3 }. 

Suppose 2("b") = 03, and 

2("Larger") = {<o 2 , o 3 ), (o 3 , oi), (o 2 , oi)}, and 

g("y") = oi. 

In that case, </*("b"), /*("y")> = <Z("b"), g("y")> = <o 3 , oi> E 2("Larger"). 
So g satisfies "Larger(b, y)" in M. 

Another example: "Adjoins(x, y)" 
Suppose M = (D, 2), where 
D = {oi, o 2 , o 3 }. 

Z("Adjoins") = {(o 2 , o 3 ), (o 3 , o 2 >}. 

g("x") = oi. 
g("y") = o 2 . 

Thus, (h("x"), hCY)) = ( g("x"), g("y")> = (oi, o 2 > $ 2("Adjoins"). 
So g does not satisfy "Adjoins(x, y)" in M. 

Satisfaction of a disjunction by a variable assignment in a structure 

To illustrate the manner in which we can define the conditions under which variable 
assignments satisfy complex formulas in a structure, consider the case of disjunctions: 

g satisfies (Q v R) in M if and only if either g satisfies Q in M or g satisfies R in M. 
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Suppose M = (D, 2), where 
D = {oi, o 2 , o 3 }. 

2("Cube")={(o 2 ), <o 3 >}. 
2("Tet")={(o 1 >}. 
2("a") = o 2 . 

g("x") = o 2 . 

<A("a")> = <2("a")> = <o 2 ) e 2("Cube"). So g satisfies "Cube(a)" in M. 
(A("x")> = <g("x")> = <o 2 > £ 2("Tet"). So g does not satisfy "Tet(x)" in M. 
So either g satisfies "Cube(a)" in M or g satisfies "Tet(x)" in M. 
So g satisfies "(Cube(a) v Tet(x))" in M. 

Satisfaction of an existential quantification by variable assignment in a structure 

To illustrate the manner in which we can define the conditions under which variable 
assignments satisfy quantified formulas in a structure, consider the case of existential 
quantifications: 

g satisfies 3vQ in M if and only if for some o G D, g[v/o] satisfies Q in M. 
Example: 

Suppose M = (D, 2), where 

D = {Oi, 2 , 3 , 4 }. 

Z("Cube") = {<o 2 >, <o 4 >}. 

g("x") = oi. 
g["x"/o 2 ]("x") = 2 . 

</*["x7o 2 ]("x")> = <g["x"/o 2 ]("x")> = (o 2 > e 2("Cube"). 
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So g["x"/o 2 ] satisfies "Cube(x)" in M. 
And o 2 e D. 

So for some o G D, g["x"/o] satisfies "Cube(x)" in M. 
So g satisfies "3xCube(x)" in M. 

Another example: 
Suppose M = (D, 2), where 
D = {oi, o 2 , o 3 , o 4 }. 
2("Adjoins") = {<oi, o 2 >, <o 2 ,oi)}. 
2("b") = 03. 

g("y") = o 2 

There is no object o G D such that 
<M"y"/o]("b"), /z["y"/o]("y")> G 2("Adjoins"). 
For example, 

<A["y"/oi]("b"), M"y"/oi]("y")> = 

<2("b"),g["y"/ 0l ]("y")> = 

(o 3 , 0!) £ 2("Adjoins"). 

So, g does not satisfy "3yAdjoins(b, y)" in M. 
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Recursive Definitions 

(Also called "inductive definitions".) 

We now want to put together a number of stipulations concerning satisfaction, such as 
those above, to form a complete definition of satisfaction. The definition of satisfaction 
will be a "recursive definition". So let's first get a sense of what those are like: 

Some ordinary definitions: 

A number x is a positive prime if and only if x is greater than 1 and x is divisible only 
by 1 and itself. 

x is a shoathanger if and only if either (i) x is a sheep or (ii) x is a coat hanger. 



A circular definition (bad! not really a "definition" at all): 

A thing x is a schmuck if and only if either (i) x is a liar or (ii) x is a friend of a 
schmuck. 

The problem with this "definition" is that if something is not a liar, and none of its 
frends are liars, and none of the friends of its friends are liars, and so on, then we can 
draw no conclusions about whether it is a schmuck or not. 



A recursive definition: 



S is a string if and only if either 

(i) S = "o", or 

(ii) S = "o" followed by a string. 



So, "o" is a string. 

"oo" is a string, since "o" is a string and "oo" = "o" followed by a string, namely, 
"o". 

"ooo" is a string, since "ooo" = "o" followed by a string, namely, "oo". 
And so on. 

But "ooxo" is not a string, because it is not "o" followed by a string; because "oxo" is 
not a string, because it is not "o" followed by a string; "xo" is not a string, 
because it is neither "o" nor "o" followed by a string. 
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Notice in this example that, although the term "string" occurs on the right-hand side, the 
definition is not circular, because whenever we want to know whether something is a 
string, we get back to the question of whether something is "o" or starts with "o", which 
is a question we can answer just by looking. 



The Definition of Satisfaction (ta da!): 

For every wff P and every structure M and every variable assignment g in M, g satisfies 
P in M if and only if either: 

(i) P = R(ti, t2 t n ), where R is an n-ary predicate and t-i, t2 t n are n terms, 

and(A(ti), h(t 2 ), h(t n )) G 2 M (R), or 

(ii) P = -i Q, where Q is a wff, and 
g does not satisfies Q in M, or 

(iii) P = (Q a R), where Q and R are wffs, and 
both g satisfies Q in M and g satisfies R in M, or 

(iv) P = (Q v R), where Q and R are wffs, and 
either g satisfies Q in M or g satisfies R in M, or 

(v) P = (Q -* R), where Q and R are wffs, and 

either g does not satisfy Q in M or g satisfies R in M, or 

(vi) P = (Q ** R), where Q and R are wffs, and 

either g satisfies both Q and R in M or g satisfies neither Q nor R in M, or 

(vii) P = VvQ, where Q is a wff, and 

for every object o G Dm, g[v/o] satisfies Q in M, or 

(viii) P = 3vQ, where Q is a wff, and 

for some object o G Dm, g[v/o] satisfies Q in M. 
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An alternative formulation 

Suppose P is a wff, M is a structure, and g is a variable assignment in M. 

1. Suppose P = R(ti, t2 t n ), where R is an n-ary predicate and t-i, t 2 t n are n 

terms. 

Then g satisfies P in M if and only if (h(U), h(t 2 ), h(t n )) G 2 M (R). 

2. Suppose P = -■ Q, where Q is a wff. 

Then g satisfies P in M if and only if g does not satisfy Q in M. 

3. Suppose P = (Q a R), where Q and R are wffs. 
Then g satisfies P in M if and only if 

both g satisfies Q in M and g satisfies R in M. 

4. Suppose P = (Q v R), where Q and R are wffs. 
Then g satisfies P in M if and only if 

either g satisfies Q in M or g satisfies R in M. 

5. Suppose P = (Q -> R), where Q and R are wffs. 
Then g satisfies P in M if and only if 

either g does not satisfy Q in M or g satisfies R in M. 

6. Suppose P = (Q <-> R), where Q and R are wffs. 

Then g satisfies P in M if and only if either g satisfies both Q and R in M, or g 
satisfies neither Q nor R in M. 

7. Suppose P = VvQ, where Q is a wff. 
Then g satisfies P in M if and only if 

for every object o G Dm, g[v/o] satisfies Q in M. 

8. Suppose P = 3 vQ, where Q is a wff. 
Then g satisfies P in M if and only if 

for some object o G Dm, g[v/o] satisfies Q in M. 

We can call this the "definition of satisfaction" too, although actually it is a list of 
"axioms". 

The empty variable assignment, g , is the variable assignment whose domain is 0. 
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The definition of truth in a structure: 

A sentence S is true in a structure M if and only if 
the empty variable assignment g satisfies S in M. 

Students often find this definition troubling. How can anything of importance depend on 
the empty variable assignment? Well, the empty variable assignment gives us a "hook" 
on which to spin variations as we consider the subformulas of the sentence in question. 

Example: 

Suppose M = (D, 2), where 

D= {oi, o 2 }, 
2("Cube")={( 0l )}, 
2("Tet") = 0. 

So (oi) G 2("Cube"). 

So <g ["x"/oi]("x")> e 2("Cube"). 

So <^["x"/oi]("x")) G 2("Cube"). 

By clause 1 in the def. of satisfaction, g z [ u x"/o\] satisfies "Cube(x)" in M. 
So there exists an o G D such that g z ["x'7o] satisfies "Cube(x)" in M. 
By clause 8 in the def. of satisfaction, g z satisfies "3xCube(x)" in M. 
So "3xCube(x)" is true in M. 

< 0l )<£Z("Tet"). 
<g,["x"/oi]("x")> $ 2("Tet"). 
(A e ["x"/oi]("x")) ^ 2("Tet"). 

By clause 1, g e [" x "/oi] does not satisfy "Tet(x)" in M. 
<o 2 )^2("Tet"). 
<g ["x"/o 2 ]("x")> ^ 2("Tet"). 
</? ["x"/o 2 ]("x")) ^ 2("Tet"). 

By clause 1, g e [" x "/o 2 ] does not satisfy "Tet(x)" in M. 

So there is no o G D such that g [" x "/o] satisfies "Tet(x)" in M. 

By clause 8, g a does not satisfy "3xTet(x)" in M. 

So "3xTet(x)" is not true in M. 
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More examples: 

Suppose M = (D, 2), where 
D = {oi, o 2 }, 
2("Cube")={(o 1 )}, 
2("a") = oi. 2("b") = o 2 . 

So (oi) G 2("Cube"). 

So <2("a")> e 2("Cube"). 

So (A ("a")) e 2("Cube"). 

By clause 1, g e satisfies "Cube(a)" in M. 

So by the definition of truth, "Cube(a)" is true in M. 

So <o 2 ) ^ 2("Cube"). 

So <2("b")> 1 2("Cube"). 

So <& ("b")> £ 2("Cube"). 

By clause 1, g z does not satisfy "Cube(b)" in M. 

So by the definition of truth, "Cube(b)" is not true in M. 

By clause 2, g z satisfies "-Cube(b)" in M. 

So by the definition of truth, "-Cube(b)" is true in M. 

Suppose M = (D, 2), where 
D = {oi, o 2 }, 

2("Sameshape") = {(oi, oi), (o 2 , o 2 ), (oi, o 2 ), (o 2 , oi)} 
2("a") = oi. 

(oi, oi) G 2("Sameshape"). 
(g ["x"/oi]("x"), 2("a")> e 2("Sameshape"). 
</2 a ["x"/oi]("x"), /i ["x"/oi]("a")> e 2("Sameshape"). 
By clause 1, g e ["x"/oi] satisfies "Sameshape(x, a)" in M. 

<o 2 , oi) e 2("Sameshape"). 
<g a ["x"/o 2 ]("x"), 2("a")> e 2("Sameshape"). 
</2 ["x"/o 2 ]("x"), /2 ["x"/o 2 ]("a")) e 2("Sameshape"). 
By clause 1, g e ["x"/o 2 ] satisfies "Sameshape(x, a)" in M. 

So for all o G D, g z ["x"/o] satisfies "Sameshape(x, a)" in M. 

So by clause 7, g satisfies "VxSameshape(x, a)" in M. 

So by the definition of truth, "VxSameshape(x, a)" is true in M. 
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The definition of first-order validity: 

An argument is first-order valid (i.e., the conclusion is a first-order consequence of the 
premises) if and only if for each structure M of the language, if the premises are all true 
in M, then the conclusion is true in M too. 

Examples: 

Example 1: The inference from "3xCube(x)" to "Cube(b)" is not first-order valid. 

Suppose M = (D, 2), where 
D = {oi, o 2 }, 
2("Cube")={(o 1 )}, 
2("b") = o 2 . 

We have already seen that "3xCube(x)" is true in M, and we have already seen that 
"Cube(b)" is not true in M. 

Example 2: The inference from "Cube(b)" to "3xCube(x)" is first-order valid: 

Suppose, for arbitrary structure M, "Cube(b)" is true in M. 

By the definition of truth, g satisfies "Cube(b)" in M. 

By clause 1 in the definition of satisfaction, (/z ("b")) G 2("Cube"). 

So there is an object o G D such that (o) G 2("Cube"). 

So there is an o G D such that (/4"x7o]("x")> G 2("Cube"). 

So by clause 1, there is an o G D such that g ["x'7o] satisfies "Cube(x)" in M. 

So by clause 8, g satisfies "3xCube(x)" in M. 

So "3xCube(x)" is true in M. 

But M was arbitrary. 

So for all structures M, if "Cube(b)" is true in M, then "3xCube(x)" is true in M as well. 

Example 3: Prove that the following argument is first-order valid: 

Vx(F(x) - G(x)) 
3xF(x) 



3xG(x) 
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Suppose "Vx(F(x) -» G(x))" and "3xF(x)" are true in arbitrary M. 

g satisfies "Vx(F(x) -* G(x))" and "3xF(x)" in M. 

For some o G D, g ["x7o] satisfies "F(x)" in M. 

Suppose a G D andg ["x7a] satisfies "F(x)" in M. 

For all o G D, g.["x7o] satisfies "(F(x) G(x))" in M. 

So, g ["x7a] satisfies "(F(x) G(x))" in M. 

Either g z ["x7a] does not satisfy "F(x)" or g z ["x7a] satisfies "G(x)" in M. 

So, g ["x7a] satisfies "G(x)" in M. 

So, for some o G D, g ["x7o] satisfies "3xG(x)" in M. 

g satisfes "3xG(x)" in M. 

"3xG(x)" is true in M. 



Example 4: Prove that the following argument is not first-order valid: 
(3xF(x) a 3xG(x)) 



3x(F(x) a G(x)) 

Suppose M = (D, 2), where 
D = {a, b}, and 
2("F") = {(a)}, and 
2("G") = {(b)}. 

<g ["x7a]("x")>G2("F"). 

</4"x7a]("x")> G 2("F"). 

g ["x7a] satisfies "F(x)" in M. 

For some o G D, g z ["x"/o] satisfies "F(x)" in M. 

g satisfies 3xF(x) in M. 

<g.["x"/b]("x"))G2("G"). 

( /2 ["x7b]("x")) G 2("G"). 

g ["x7b] satisfies "G(x)" inM. 

For some o G D, g z ["x7o] satisfies "G(x)" in M. 

g satisfies "3xG(x)" in M. 

g satisfies "3xF(x)" in M andg satisfies "3xG(x)" in M. 
g satisfies "(3xF(x) a 3xG(x))" in M. 
"(3xF(x) a 3xG(x))" is true in M. 
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<g.["x7b]("x")> $ Z("F"). 

</z ["x7b]("x")> g 2("F"). 

g ["x7b] does not satisfy "F(x)" in M. 

g ["x7b] does not satisfy "(F(x) a G(x))" in M. 

(g,["x7a]("x")> $ 2("G"). 

<A B ["x7a](V )) £ 2("G"). 

g ["x7a] does not satisfy "G(x)" in M. 

g ["x7a] does not satisfy "(F(x) a G(x))" in M. 

For all o G D, g ["x7o] does not satisfy "(F(x) a G(x))" in M. 
g does not satisfy "3x(F(x) a G(x))" in M. 
"3x(F(x) a G(x))" is not true in M. 



NOTE: In all subsequent examples, I will omit all quotation marks. However, do not 
forget that "really" they are there. 

Example 5: Show that the following argument is first-order valid: 
3xVyR(x, y) 



Vy3xR(x, y) 

Suppose 3xVyR(x, y) is true for arbitrary M. 
g satisfies 3xVyR(x, y) in M. 
For some o G D, g z [x/o] satisfies VyR(x, y) in M. 
Suppose a G D andg [x/a] satisfies VyR(x, y) in M. 

For all o G D, g [x/a][y/o] satisfies R(x, y) in M. 

So, for all o G D, g [y/o][x/a] satisfies R(x, y) in M. (Notice the difference.) 

For all o G D, there is some o' G D such thatg [y/o][x/o'] satisfies R(x, y) in M. 
For all o G D, g [y/o] satisfies 3xR(x, y) in M. 
g satisfies Vy3xR(x, y) in M. 

Vy3xR(x, y) is true in M. 
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Example 6: Show that the following argument is not first-order valid: 
Vx3yR(x, y) 



3yVxR(x, y) 

Suppose M = (D, 2), where 
D = {a, b}, and 
2(R) = {(a, a), (b, b)}. 

<g4x/a][y/a](x), g,[x/a][y/a](y)> e 2(R). 
g [x/a][y/a] satisfies R(x, y) in M. 
g [x/a] satisfies 3yR(x, y) in M. 

(g4x/b][y/b](x), g.[x/b][y/b](y)> e 2(R). 
g [x/b][y/b] satisfies R(x, y) in M. 
g [x/b] satisfies 3yR(x, y) in M. 

For all o G D, g [x/o] satisfies 3yR(x, y) in M. 
g satisfies Vx3yR(x, y) in M. 
Vx3yR(x, y) is true in M. 

fe.[y/a][x/b](x), g.[y/a][x/b](y)) ^ 2(R). 
g [y/a][x/b] does not satisfy R(x, y) in M. 

It is not the case that for all o G D, g [y/a][x/o] satisfies R(x, y) in M. 
g [y/a] does not satisfy VxR(x, y) in M. 

<g4y/b][x/a](x), g [y/b][x/a](y)) g 2(R). 
g [y/b][x/a] does not satisfy R(x, y) in M. 

It is not the case that for all o G D, g [y/b][x/o] satisfies R(x, y) in M. 
g [y/b] does not satisfy VxR(x, y) in M. 

For all o G D, g [y/o] does not satisfy VxR(x, y) in M. 
g does not satisfy 3yVxR(x, y) in M. 
3yVxR(x, y) is not true in M. 
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Example 7: Show that the following argument is first-order valid: 
^3x(F(x) a G(x)) 



Vx(F(x) - ^G(x)) 

Suppose, for arbitrary M, that -3x(F(x) a G(x)) is true in M. 

g satisfies ->3x(F(x) a G(x)) in M. 

g does not satisfy 3x(F(x) a G(x)) in M. 

There is no object o G D such thatg [x/o] satisfies (F(x) a G(x)) in M. 

There is no object o G D such that both g [x/o] satisfies F(x) in M and g [x/o] satisfies 

G(x) in M. 

For all objects o G D, either g [x/o] does not satisfy F(x) in M or g z [x/o] does not satisfy 
G(x) in M. 

For all objects o G D, either g z [x/o] does not satisfy F(x) in M or g [x/o] satisfies - , G(x) 
in M. 

For all objects o G D, g [x/o] satisfies (F(x) -» -G(x)) in M. 
g satisfies Vx(F(x) -> -G(x)) in M. 
Vx(F(x) -G(x)) is true in M. 



Example 8: Show that the following argument is not first-order valid. 
(VxF(x) - G(c)) 



Vx(F(x) - G(c)) 

Suppose M = (D, 2), where 
D = {a, b}, and 
2(F) = {(a)}, and 
2(G) = {<b>}. 
2(c) = a. 

<g.[x/b](x)> ^ 2(F). 

g [x/b] does not satisfy F(x). 

For some o G D, g [x/o] does not satisfy F(x). 

g does not satisfy VxF(x) in M. 

Either g z does not satisfy VxF(x) in M or g satisfies G(c) in M. 
g satisfies (VxF(x) -*■ G(c)) in M. 
(VxF(x) -» G(c)) is true in M. 
(g [x/a](x)) G 2(F). 
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g [x/a] satisfies F(x) in M. 

(2(c)) g 2(G). 

</4x/a](c)> t 2(G). 

g [x/a] does not satisfy G(c) in M. 

It is not the case that either g [x/a] does not satisfy F(x) in M or g z [x/a] satisfies G(c) 
M. 

g [x/a] does not satisfy (F(x) -» G(c)) in M. 

It is not the case that for all o G D, g [x/o] satisfies (F(x) -» G(c)) in M. 
g does not satisfy Vx(F(x) -> G(c)) in M. 
Vx(F(x) -* G(c)) is not true in M. 



Example 9: Show that the following argument is not first-order valid. 
Vxl_ikes(x, d) 



Vxl_ikes(x, x) 

Suppose M = (D, 2), where 
D = {a, b}, and 

2(Likes) = {(a, a), (b, a)}, and 
2(d) = a. 

(g [x/a](x), 2(d))G2(Likes). 
(A [x/a](x), A [x/a](d)) E 2(Likes). 
So g [x/a] satisfies Likes(x, d). 
(g [x/b](x), 2(d)) e 2(Likes). 
(h z [x/b]{x), A [x/b](d)) e 2(Likes). 
So g [x/b] satisfies Likes(x, d) 
For all o G D, g z [x/o] satisfies Likes(x, d). 
g satisfies VxLikes(x, d). 
VxLikes(x, d) is true in M. 

(g [x/b](x),g [x/b](x))^2(Likes). 

g [x/b] does not satisfy Likes(x, x). 

For some o G D, g [x/o] does not satisfy Likes(x, x). 

g does not satisfy Vxl_ikes(x, x). 

VxLikes(x, x) is not true in M. 



Lesson 2: The Soundness Theorem for First-order Logic 



Proof by induction 

In proving general claims about sentences, arguments, etc., we will sometimes make use 
of the method of proof by induction, or inductive proof So I want to begin by giving you 
that concept. I will start with a simple example, and then I will generalize in a vague way 
from that. 

The example I will use has to do with three-valued logic. So first, before I give you the 
example of an inductive proof, I need to say a little about that. 

Suppose we are dealing with a language L containing atomic sentences, the negation sign 
and the disjunction sign. Since we don't have quantifiers, we can ignore names and 
predicates and suppose that the atomic sentences are A, B, C, .... Sentences are built up 
from these atomic sentences in the usual way. 

Say that atomic sentences have complexity 0. 

If a sentence P has complexity n, then ->P has complexity n+l. 

If out of the two sentences P and Q, the complexity of the one with the greatest 

complexity is n, then the complexity of (P v Q) is n+\. 

For example, since A is an atomic sentence, ->A has complexity 1. 
The complexity of (B v C) is 1. 

The complexity of ((A v B) v -> (B v C)) is 3 (not 2, not 4!). Can you see why? 

Say that a three-valued assignment o for £ is a function that takes atomic sentences of £ 
as inputs and yields as outputs a member of the set {T, N, F}. "N" stands for neither. 
We think of sentences to which N is assigned as neither true nor false. 

Let the truth tables for -> and v be as follows: 
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T 


F 


N 


N 


F 


T 
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These tables simply display in graphic form the following definition of an evaluation V„. 

For every three-valued assignment a and every sentence S of £, 

(i) if S is atomic, then K,(S) = a(S), and 

(ii) ifS = -P,then 

(a) V£S) = T if and only if V£P) = F, and 

(b) V£S) = F if and only if V£P) = T, and 

(c) VXS) = N if and only if VjP) = N, and 

(iii) ifS = (P v Q), then 

(a) VXS) = T if and only if K,(P) = T or K(Q) = T, and 

(b) K,(S) = F if and only if K,(P) = F and K(Q) = F, and 

(c) V£S) = N in every other case. 



Now I state the following theorem 



Theorem: Suppose a and o* are three-valued assignments for language £. 
And for all atomic sentences P of £, 

if a(P) = T, then a*(P) = T, and 

if a(P) = F,then a*(P) = F. 

(That much is the "hypothesis" of the theorem.) 
Then for all sentences P of £ (what follows is "the thesis"), 

if V£P) = T, then V*(P) = T, and 

if VIP) = F, then V*{P) = F. 

In other words, a* does not change any of the assignments that a makes, but o* may 
assign T or F to more atomic sentences than o does. Nonetheless, the theorem states that 
the switch from a to a* creates no new truth-value gaps among the compound sentences. 
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Proof: By induction "on the complexity of sentences". 

Assume the hypothesis of the theorem (viz., that for all atomic sentences, . . .). 

Basis: Show that the thesis holds for all atomic sentences (sentences of complexity 0). 
For each sentence P of complexity 0, V£P) = a(P) and V*(P) = a*(P). So since a* 
assigns T or F if a does, V* assigns T or F if V does. In other words, the hypothesis, 
restricted to atomic sentences, implies the thesis, restricted to atomic sentences. (The 
basis clause for an inductive proof will not always be as easy as this!) 

Induction hypothesis (IH): Suppose that the thesis holds for all sentences having a 
complexity less than or equal to n. 

Induction step: Show that the thesis holds for all sentences having complexity equal to 
n+l. 

(-) Suppose P = - Q has complexity n+\ and K(-Q) = T. Then K„(Q) = F. But Q has 
complexity n; so by the induction hypothesis, K*(Q) = F. So K*(- , Q) = T. 

Suppose P = ->Q has complexity n+\ and F„(- , Q) = F. Then V£Q) = T. But Q has 
complexity n; so by the induction hypothesis, V a *(Q) = T. So K*(- , Q) = F. 

(v) Suppose P = (Q v R) has complexity n+\ and V,((Q v R)) = T. Then either K„(Q) 
= T or K(R) = T. But either Q or R has complexity n, and the other has complexity 
less than or equal to n; so by the induction hypothesis, either K,*(Q) = T or V,*(R) = 
T. So K*((Q v R)) = T. 

Suppose P = (Q v R) has complexity n+\ and V£(Q vR))=F. Then both K,(Q) = 
F and K(R) = F. But either Q or R has complexity n, and the other has complexity 
less than or equal to n; so by the induction hypothesis, both V,*(Q) = F and K*(R) = 
F. So K*((Q v R)) = F. 

End of Proof (In other words, as this point, we consider the theorem proved.) 

So here is the general pattern of a proof by induction: 

When we are doing a proof by induction, we are always trying to say something about all 
of the members of a certain set of objects. (In the example, it might be the set of 
sentences in £.) Moreover, the membership of that set is defined "inductively". That is, 
we begin the definition of the set by stipulating that certain basic objects belong to the 
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set. (In the example, that's the atomic sentences.) Then we say that if a certain number 
of things that we already know are in the set generate some other object, via one or 
another given member-generating functions, then that other object is in the set too. And 
that gives us the entire membership of the set. (In the example, the functions are 
"sentence-forming operations", which given one or more sentences, generate another 
sentence.) 

If a set is defined inductively in this way, then we can prove that a thesis is true of every 
member of the set as follows: First, we show that the thesis holds for all basic members. 
That step is called the basis. (In the example, the hypothesis of the theorem was itself 
this part.) Then we suppose, for the sake of argument, that the thesis holds for some 
arbitrary members of the set, which for an arbitrary m we can also characterize as 
members that are generated by no more than m applications of the member-generating 
functions. That is the induction hypothesis. (In the example, we supposed that the thesis 
held for all sentences having complexity less than or equal to n. The maximum number 
of applications of sentence-forming operations that are necessary to form a sentence 
having complexity n is {In - 1) (try it!). So, in effect, we are stipulating a maximum 
number of sentence-forming operations.) Finally, we examine each of the member- 
generating functions and show that if we apply that function to the members of the set of 
which we are supposing the thesis holds, then the thesis holds as well for the members of 
the set that that function yields. That's the induction step. (In the example, we supposed 
that the thesis held for sentences having a maximum complexity of n and show that it 
holds for sentences having a maximum complexity of n+l, i.e., sentences that result from 
one more application of a sentence-forming operation.) On this basis, we conclude that 
the thesis holds for every member of the set. 

Question: What gives us the right to assume that an inductive proof of this kind proves 
the theorem? Is that a logical truth? No, it's a fact about inductively defined sets (a fact 
of set theory). 

Simplification of the language 

In proving soundness and completeness, we will have to say something about each 
connective and quantifier in the language. So if we have fewer connectives and 
quantifiers, we will not have to say as much. So now I will stipulate that the language of 
first-order logic contains only the following logical symbols: -> , -*, V, =. So we're 
dropping a, v, and 3. 

To be precise, we now define the language £ as follows: 
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The vocabulary of £: 

Denumerably many individual constants: a, b, C, ... 
Denumerably, many individual variables: x, y, z, ... 

(Recall from Lesson 1 that individual constants and individual variables are called terms/) 

For each n, denumerably many n-ary predicates: F, G, H, ... 

The identity symbol: = 

The following sentential connectives: -■, -» 

The universal quantifier: V 

("Denumerably many" means: One for each of the natural numbers, 0, 1, 2, 3, ... .) 

The definitions of wff bound variable and sentence of £: 

A string S of symbols from the vocabulary of £ is a well-formed formula (wff) of £ if 
and only if: 

(a) S = Pt-|t2...t n and P is an n-ary predicate of £ and ti, t2, t n are terms of £, or 

(b) S = -i P and P is a well-formed formula of £, or 

(c) S = (P -* Q) and P and Q are well-formed formulas of £, or 

(d) S = VvP and P is a well-formed formula of £ and v is a variable of £ (v is said to 
be bound in this case). 

Note: In this language we will not write parentheses after predicates. So in this 
language, an atomic sentence will be, for example, Hab, not H(a, b). There is a reason 
for this other than simplicity. Now when I do write something like P(v) that will stand 
for a formula that has a free variable v in it somewhere. If I write something like Pa/v, 
that will stand for the result of putting a in place of each free occurrence of v in P. For 
example, (Hax -*■ VxGx)b/x is (Hab -*■ VxGx). 

A sentence of £ is a well-formed formula of £ in which every variable is bound. 

Note: The elimination of a, v, and 3 does not really reduce our "expressive power", 
because for every sentence in the old language containing one of these symbols there is a 
first-order equivalent sentence in that language that does not contain those symbols. That 
claim can be proved, by induction, but here I'll just state the general principles: 

Any sentence of the form 3xP is FO-equivalent to -> Vx-> P. 
Any sentence of the form (P v Q) is FO-equivalent to (-> P -* Q). 
Any sentence of the form (P a Q) is FO-equivalent to --(P -> ->Q). 
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Exercise: Consider the language containing a and 3 in addition to the vocabulary of L. 
Show by induction on the complexity of sentences that for every sentence of that 
language there is an equivalent sentence that contains only the vocabulary of £. 

We can go on using a, v and 3, but we will think of sentences containing these symbols 
as merely abbreviations for sentences that do not contain them. We will also go on using 
-L, but now we will think of this as some particular contradiction that we can write using 
just -■ and ->, such as -■(Fa -> Fa). 

Accordingly, we will not need the introduction and elimination rules for the connectives 
and quantifiers we have thrown away. If we throw away those introduction and 
elimination rules, will that mean that we cannot prove as many arguments? Yes and no. 
Yes, we cannot give proofs for arguments containing sentences containing symbols that 
are not in our language. But no, for any argument in the old language, if we could have 
proved it with the full set of Fitch rules, then we can prove its translation into the 
impoverished language using the impoverished set of rules. 

For example, A-Elim gives us the following one-step proof: 
(A a B) 



B 

But using just the -■ , -* and _L rules, we can give the following proof of the "translation" 
of this argument. A and B might themselves be sentences containing a, v and 3; so let 
A' and B' be their translations, respectively. 



-(A' nB') 





B' 




A' 




-B' 


A' — - 


1 




B' 



B' 
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3vF 




[a] Fa/v 




Q 


Q 



On the assumption that Q can be derived from Fa/v, we can do the "same thing" without 
3-Elim using our remaining rules. Suppose that F' is the translation of F and Q' is the 
translation of Q. We can assume that if Q can be derived from Fa/v, then Q' can be 
derived from F'a/v (because we're thinking of this as a step in an inductive proof on the 
complexity of sentences). 

-Vv-F' 



M 




F'a/v 




Q' 




1 


-F'a/v 



Vv-F' 



± 
Q' 
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Exercise: Show that we can similarly "prove" v-Elim using only the rules that are left 
after we throw away the a, v and 3. 

Some Conventions and Definitions 

Notational conventions: Upper case sans-serif letters from before the middle of the 
alphabet used as predicates (F, G, etc.) will be predicates of the object language. Upper 
case sans-serif letters from after the middle of the alphabet used as predicates (P, Q, etc.) 
will be schematic predicate letters of the metalanguage. Single upper-case sans-serif 
letters from the beginning of the alphabet (A, B, etc.) will be abbreviations of sentences 
of the object language. Single upper-case sans-serif letters from the beginning of the 
alphabet (P, Q, etc.) will be schematic sentence letters of the metalanguage. 

Again, _L is an abbreviation of a particular contradiction, let's say -■(Fa -> Fa). 

We will not say that a sentence "follows" from some other sentences, since that is 
ambiguous. We will say either that the sentence can be derived from those other 
sentences using our proof rules, or we will say that the sentence is a first-order 
consequence of those other sentences. 

Where A is a (possibly infinite) set of sentences of language L and P is a sentence of L, 
we say that A |— P if and only if there is a proof in L of P from A. (Alternatively, we say 
P can be derived from A; and P is a syntactic consequence of A.) "|- " denotes the 
syntactic consequence relation for £. The symbol "|— " is called the single turnstile. 

Where A is a (possibly infinite) set of sentences of L, and P is a sentence of £„ we say 
that A \= P if and only if P is a first-order consequence in £ of A. (Alternatively, we say 
the argument having the sentences in A as premises and P as conclusion is first-order 
valid; and P is a (first-order) semantic consequence of A.) "|=" is the double turnstile. 

We will define proofs and subproofs as certain kinds of sequences whose members are 
sentences and other sequences. When I speak of a member of a sequence, such as a 
sentence or another sequence, I do not include sentences or sequences that are members 
of members. So the sequence (A, (B, C)) has just two members, A and (B, C). B, for 
instance, is not a member of the sequence (A, (B, C)). So also, in counting sentences in a 
sequence, we do not count the sentences in a subproof in the sequence. 
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One sequence S' is a component of another sequence S if and only if either S' is identical 
to S or S is a member of S or 5" is a member of a component of S. So the components of 
(D, E, (A, (B, C»> are (D, E, (A, (B, C»> itself, (A, (B, C» and (B, C). (B and C are 
not "components".) 



Proofs as Sequences 

We have been thinking of proofs as things that we write on the vertical dimension with 
"bars" marking subproofs. Now we are going to start thinking of proofs as sequences. 
The first member of a sequence that is a proof will be a set, the set of premises. 
Subsequenct members will be sentences or other sequences. The sequences that are 
components of the sequences that are proofs will be subproofs. 

All of our proof rules can be reconceived as "permissions" on such sequences. For 
example: The rule -* Intro can be written: 

(...,(P,...,Q), ...,(P-Q), ....) 

In a sequence of this form, we will say that (P -» Q) can be derived by ->-lntro from 
<P,...,Q>. 

The rule V-lntro can be written: 

(. . . , (n, . . . , Pn/v), . . . , VvP, . . .), where n does not occur at any higher point in the 
sequence and does not occur in P. 



Examples: 

({(A B), (B C)}, (A, B, C), (A C)> is a proof. We will say that it has depth 1 
because the only sequence that it contains contains no further sequence. We will say that 
A and (A -> B) are "higher" than B and that B can be derived from higher lines by -»- 
Elim. Similarly, C can be derived from higher lines by -*-Elim. (A -*■ C) can be 
derived from (A, B, C) by -*-lntro. 

({(A B)}, (-B, (A, B, 1),-A> (-B -A)) is aproof. It has depth 2 because it 
contains a sequence that contains a sequence (but that sequence contains no further 
sequence). The sentences that are higher than B are A, -> B, and (A-» B). However, A is 
not higher than ->A and -> A is not higher than (-> B -» -> A). 
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<{VxF(x), Vx(F(x) -* G(x))}, (a, F(a), (F(a) — G(a), G(a)>, VxG(x)) is a proof of 
depth 1. 

(Notice that I do not require that the sentence justified by the subproof immediately 
follow the subproof. Fitch does not require that either, although I never pointed that out.) 

The rules ->-Elim, -»-Elim, J_-lntro, -L-Elim, or V-Elim, Reit are our inference rules. 

The rules -Intro, ->-lntro, and V-lntro are our structural rules. (These are also called 
inference rules in a broad sense of the term.) 

The definition of proof 

So what is a proof? We still lack a precise definition of proofs as sequences. (I never did 
give you a precise definition of a proof when we were writing proofs vertically.) We 
define this concept inductively. Throughout I assume we are talking about the sentences 
of a particular language £, although I will usually suppress reference to it. 

A declaration: A name (treated as an assumption for V-lntro). 

A prima facie subproof of depth 0: A (finite) sequence whose first member is either a 
sentence (an assumption) or a declaration and whose subsequent members are all 
sentences. 

A prima facie subproof of depth m+\: A (finite) sequence whose last member is a 
sentence (the conclusion), whose first member is either a sentence (an assumption) or a 
declaration and whose subsequent members are all either sentences or prima facie 
subproofs of depth no greater than m, and which contains at least one subproof of depth 
m. 

A prima facie proof of depth 0: A (finite) sequence whose first member is a (possibly 
infinite) set of sentences (the premises) and whose subsequent members are all sentences. 

A prima facie proof of depth m+\: A finite sequence whose last member is a sentence 
(the conclusion), whose first member is a set of sentences (the premises) and whose 
subsequent members are all either sentences or prima facie subproofs of depth no greater 
than m, and which contains at least one subproof of depth m. 
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An item is either a sentence or a prima facie subproof. 

If i is an item and Q is a sentence and i and Q are members of components of sequence S, 
then i is higher than Q in S if and only if either (i) i is a member of the set of premises 
and Q is not, or (ii) i and Q are members of the sequence S', which is a component of S, 
and i is earlier than Q in S', or (iii) i is a member of a sequence S', a component of 5*, and 
sequence 5* "is a member of S', and i is earlier than S "in 5", and Q is a member of a 
component of S". 

A subproof S' of depth in prima facie subproof or proof S: A prima facie subproof of 
depth that is a component (not necessarily a member) of a prima facie subproof or 
prima facie proof S of depth greater than such that each member (sentence) of S 'either 
(i) is an assumption or a declaration at the beginning of S or (ii) can be derived by one of 
our inference rules from sentences that are higher in S than it. 

A subproof S' of depth m+1 in prima facie subproof or proof S: A prima facie subproof 
of depth m+1 that is a component (not necessarily a member) of a prima facie subproof or 
proof S of depth greater than m+1 such that each item in 5" either (i) is an assumption or a 
declaration at the beginning of S 'or (ii) can be derived by one of our inference rules or 
structural rules from items that are higher in S than it or (iii) is a subproof in S' of depth 
no greater than m in S, and there is at least one subproof in 5" that has a depth of m. 

A proof of depth 0: A prima facie proof of depth whose first member is a set of 
sentences (the premises) and whose subsequent members are all sentences that can be 
derived from earlier members by one of our inference rules. 

A proof S of depth m+1 : A prima facie proof of depth m+1 whose members subsequent 
to the set of premises are all either (i) sentences that can be derived from earlier items by 
one of our rules, or (ii) subproofs in S of depth no greater than m, and there is at least one 
subproof in S' has a depth of m. 

A proof is a sequence that for some m > is a proof of depth m. 

Notice that every subproof is also a prima facie subproof, and every proof is also a prima 
facie proof. 

We will say that there is a proof ofP from A if and only if there is a proof for which the 
premises are the sentences in .4 and the last item is the sentence P. 
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Some preliminary lemmas: 

I make the following assumptions about first-order consequence (provable from the 
definition of first-order consequence): 

Semantic Weakening: If A (= P and^4 QB, then B \= P. 
Semantic Cut: If A \= P and for al\QGA,B\= Q, then B \= P. 

(Semantic Weakening is an immediate consequence of Semantic Cut. I will make 
frequent use of Weakening without mentioning it.) 

Lemma 1: (a) Every argument in which the conclusion can be derived from the premises 
by one of our inference rules is first-order valid, and (b) each of our structural rules is 
validity-preserving . 

Proof of (a): Exercise. Use the definition of first-order validity. 

Proofof(b): 

--Intro: The claim is that if A U {Q}|=L, then A (=-.Q. Supposed (£-Q. Then 
there is a structure 710 such that every sentence in A is true in 710 and -> Q is not true in 
710. So every sentence in A is true in 710 and Q is true in TO. But _L is not true in TO. 
So A U {Q} (£±. 

-^-Intro: If A U {P}|=Q, then .4 |=(P Q). Exercise. 

V-lntro: The claim is that if A \= Pn/v, where n is a name that does not occur in any 
member of A and does not occur in P, then A (= VvP. Suppose A \t VvP. Then there 
is a structure TO and an object o in the domain of the structure such that every 
sentence in .4 is true in the structure but g (v/o) does not satisfy P. Now consider a 
structure TO' just like TO except that 2(n) = o. A sentence that does not contain n is 
true in TO' if and only if it is true in TO. (Strictly speaking, we should prove that by 
induction on the complexity of sentences; but it's obvious.) Likewise, if n does not 
occur in P, then g satisfies Pn/v in 710' (Pn/v is true in 710') if and only if g (v/o) 
satisfies P in 710. So every sentence in A is true in 710', but Pn/v is not true in 710'. So 
A \h Pn/v. 

Lemma 2: Let 5"be a subproof of depth in proof S. Each sentence after the 
assumption (if there is one) is a first-order consequence of the assumption together with 
sentences higher in the proof. 



L2: The Soundness Theorem 3/19/10 6:32 PM 



Page 40 



Proof: By induction. 

Basis: Show that the first sentence after the assumption has this property. The first 
sentence can be derived from the assumption and higher sentences by one of our 
inference rules. So the thesis holds by Lemma 1 (a). 

Induction Hypothesis: The first through nth sentences after the assumption of S' are 
first-order consequences of the assumption and sentences higher than S 'in S. 

Induction Step: We have to show that the n+lst sentence after the assumption is a 
first-order consequence of the assumption and sentences higher than S' in S. This is 
an immediate consequence of the induction hypothesis and Semantic Cut. (In the 
statement of Semantic Cut, let B consist of the assumption and higher sentences, and 
let A consist of the assumption, higher sentences and the first through nth sentences 
after the assumption.) 

Lemma 3: Let S"be a subproof in proof S. Suppose that for every subproof S" in S', 
every sentence in S" after the assumption (if there is one) is a first-order consequence of 
the assumption of S "and sentences in S higher than S" (which includes sentences higher 
than S "in S). Then every sentence in S' after the assumption is a first-order consequence 
of the assumption of S' and sentences higher than S 'in S. 

Basis: Show that the first sentence P in S' after the assumption or declaration is a 
first-order consequence of the assumption and sentences higher in S. We have two 
cases to consider: 

Case 1: P can be derived by an inference rule from the assumption of S' and 
sentences higher in S. Then the thesis is a consequence of Lemma 1 (a). 

Case 2: P can be derived by a structural rule from a subproof 5"' in S'. (So the 
first item in 5" after the assumption or declaration is not a sentence but the 
subproof 5"'.) Then by the hypothesis of the lemma and Lemma 1 (b), P is a first- 
order consequence of the assumption of S' and sentences higher than S' in S. 

(For example: Lemma 1 (b) tells us that if A U {Q} |= 1, then^ (= ->Q. Think 
of A as comprising the assumption of S' and the sentences higher than S' in S, 
think of Q as the assumption of S", think of 1 as the last item in S", and think of 
-Q as P.) 
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Induction Hypothesis: Each of the first through nth sentences after the assumption or 
declaration in S' are first-order consequences of the assumption of S' and higher items 
in S. 



Induction Step: Where P is the n+lst sentence after the assumption in S', we have to 
show that P is a first-order consequence of the assumption of S' and sentences higher 
than S' in S. 



Case 1: P can be derived by an inference rule from sentences earlier in 5" and 
higher than S 'in S. By Lemma 1 (a), P is a first-order consequence of sentences 
earlier in S' and higher than S' in S. So by the induction hypothesis and Cut, P is 
a first-order consequence of the assumption of S' and sentences higher than S' in 
5. 



Case 2: P can be derived by a structural rule from a subproof S" in S'. By 
Lemma 1 (b), P is a first-order consequence of sentences higher than S" in S. 

Then by the induction hypothesis and Cut, P is a first-order consequence of the 
assumption of 5" and sentences higher than S' in S. 



Lemma 4: In any subproof S' in any proof S, every sentence in S 'after the assumption 
has the property of being a first-order consequence of the assumption of S' and sentences 
higher than S' in S. 



Basis: Let 5"be a subproof of depth in proof S. By Lemma 2, S has the property. 



Induction Hypothesis: Any subproof of depth no greater than m in S has the property. 

Induction Step: We need to show that any subproof of depth m+\ has the property. 
This is an immediate consequence of Lemma 3. 



The Soundness Theorem for First-order Logic: If A \- Q, then .4 |= Q. 

Proof: We prove something stronger: Each sentence in a proof is a first-order 
consequence of the premises. Again, we proceed by induction. (The proof is similar to 
the proof of Lemma 3.) 

Basis: Let S be a proof. Where P is the first sentence in 5* after the premises, show 
that P is a first-order consequence of the premises. Two cases: 
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Case 1: P can be derived from the premises by an inference rule. By Lemma 1 
(a), P is a first-order consequence of the premises. 

Case 2: P can be derived from a subproof by a structural rule. (So the first item 
after the premises is a subproof.) By Lemma 4, the last sentence of the subproof 
is a first-order consequence of the assumption and the premises (which are the 
only higher sentences). So by Lemma 1 (b), P is a first-order consequence of the 
premises. 

Induction Hypothesis: The first through nth lines after the premises are first-order 
consequences of the premises. 

Induction Step: Where P is the n+lst line, we have to show that P is a first-order 
consequence of the premises. 

Case 1: P can be derived from earlier sentences by an inference rule. By Lemma 
1 (a), P is first-order consequence of the earlier sentences. Then the thesis holds 
by the induction hypothesis and Cut. 

Case 2: P can be derived from a subproof S' in S by a structural rule. By Lemma 
4, the last sentence in S' is a first-order consequence of the assumption of S' and 
sentences higher than S' in 5*. So by Lemma 1 (b), P is a first-order consequence 
of sentences higher than S' in S. So by the induction hypothesis and Cut, P is a 
first-order consequence of the premises. 



Lesson 3: The Completeness Theorem for Truth-functional 
Logic 



Recall the Soundness Theorem: If A \- Q, then^ (=Q. 

We now want to prove the Completeness Theorem (for first-order logic): 
If ,4 ^Q, then ,4 |- Q. 

However, we will approach it in stages. 

Define: A \- tt Q if and only if Q can be derived from sentences in A using only the 
introduction and elimination rules for ->, and 1-lntro. (In other words, there is a 
proof, call it tt-proof, of Q from A using just those rules.) Call these the sentential rules 
(since they do not deal in subformulas). I assume that the relation \- tt pertains to a 
particular language, although I usually suppress reference to it. 

We do not need l-Elim, because if we can construct a proof like (..., J_, Q), then we can 
construct a proof like (..., (-■Q, l), -■-■Q, Q). We also do not need Reit, because if 
we can construct a proof like (. . . , P, . . . , P), then we can construct a proof like 
<...,P, ...,^P, P, P). 

Let val (lower case "v") be an assignment of the truth values T and F to the atomic and 
quantified sentences (i.e., noncompound sentences) of L. 

Let Val be a function from the sentences of L into {T, F} such that 
Val(P) = T (capital "V") if and only if either: 

(i) P is an atomic or quantified sentence and val{P) = T, or 

(ii) P=-QandFa/(Q) = F, or 

(iii) P = (Q -> R) and either Val(Q) = F or Val(R) = T. 

(Remember that we have thrown out a, v and **.) 

So Val, called an evaluation, "extends" to the rest of the language the truth value 
assignment val. 

Define: A \=a Q if and only if for every truth value assignment val, if for all P G A, Val(P) 
= T, then Val{Q) = T. (In other words, Q is a tautological consequence of A. Again, I 
assume we are talking about a particular language, although I suppress reference to it.) 
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Define: A is tt-satisfiable if and only if there is an assignment of truth values to the 
atomic and quantified components vol such that every member of A is true on that 
assignment, i.e., for all SGi, Val(S) = T. (A is truth-functionally consistent.) 

Before we prove the Completeness Theorem, we will prove the completeness of the 
sentential rules relative to tautological consequence: 

The Truth-functional Completeness Theorem: If A \=t t Q, then A \- tt Q. 

Here is how we will do that. We will prove the following three Lemmas, from which the 
Truth-functional Completeness Theorem follows: 

Lemma 1: If ,4 [At S then ,4 U {-S} [At -L 

Lemma 2: If B [At -L, then B is tt-satisfiable. 

Lemma 3: If .4 U {-■S} is tt-satisfiable, then A \kt S. 

The Truth-functional Completeness Theorem immediately follows from L1-L3: 

If .4 |AtSthen^ % S. 

i.e., if .4 [=tt S, then .4 |— t t S. 
(Think of B in L2 as A U {-"S}.) The hard part will be to prove Lemma 2. 

Proof of Lemma 1: 

We will prove it in this form: If A U { S} Kt -L, then A \-^ t S. 

Observe: If we can have a proof like this: 

(AU{^S},...,±) 

then we can have a proof like this: 

(A,(^S,. ..,!)). 

Add two steps, by -Intro and ->Elim. 
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(A,(^S, ...,±>, — S, S). 

Thus, we obtain a proof of S from A. 

Proof of Lemma 3: 

If there is a truth value assignment vol such that for all P G A, Val(P) = T and 
Val(-'S) = T, then there is a truth value assignment, namely, the same one, such that 
for all PGi, Val(P) = T and Val(S) = F. 

Lemma 2 is an immediate consequence of the following two theorems: 

Theorem: Satisfiability of formally consistent and complete sets: 

Suppose that M is a set of sentences that is formally consistent and formally complete, 
that is: 

(i) M \4- tt J. (formal consistency), and 

(ii) For all sentences S, either M S or M \- a -"S (formal completeness). 
Then there is a truth value assignment vol such that for all S G M, Val(S) = T. 

Theorem: Completability of formally consistent sets: 

Suppose B is formally consistent (i.e., B \f tt -L). Then there is a formally consistent, 
formally complete set M such that B QM. 

Proof of Lemma 2, given these theorems: 

Suppose that B [At -L- By the completability of formally consistent sets, there is a 
formally consistent, formally complete set M such that B QM. By the satisfiability of 
formally consistent and complete sets, M is tt-satisfiable. Since B G M, B is tt-satisfiable 
too. 

It remains to prove the satisfiability of formally consistent and complete sets and the 
completability of formally consistent sets. 
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Lemma 4 (= Lemma 3 in Barwise and Etchemendy, p. 472): 

Suppose A is a formally consistent and formally complete set of sentences. 

1. A U -P if and only if A [At P. 

2. A K (P -* Q) if and only if either.4 [At P or .4 |- tt Q. 

(Note: You would expect a proof of the completeness of the sentential proof rules to use 
those rules somehow. They are used here, in proving Lemma 4, as well as in the proof of 
the completability of formally consistent sets.) 

Proof of 1: 

Left-to-right: By the assumption that A is formally consistent, 
ft A h-P,then^ \t- P. 

Right-to-left: By the assumption that A is formally complete, 
if A \/- P, then ,4 |--P. 

Proof of 2: 

Right-to-left: 

Case (i): Suppose A [At P. Since A is formally complete, A\- it ^P. So we can 
construct a proof like the following, using l-lntro, ->-lntro, ->-Elim and -*-lntro: 
{A, -.P, <P, <^Q, 1), ^Q, Q), (P - Q)>. 

Case (ii): Suppose A | — « Q- Then we can use -*-lntro to constuct a proof like 
this:<4<P, ...,Q>, (P-Q)). 

Left-to-right: 

Suppose A | — t t P and A [At Q- Since A is complete, A \- tt ->Q. 

So we can construct the following proof, using -*-Elim, L-lntro, and -Intro: 

(A,...,P, -.Q, <(P-Q), Q, 1), -(P-Q)). 

Since A is formally consistent, A [At (P -* Q). 



Proof of the satisfiability of formally consistent and formally complete sets: 

Suppose M is formally consistent and complete. Let val be such that for all atomic 
and quantified sentences S of £, val(S) = T if and only if M |— tt S. We prove by 
induction that for all sentences S of £, Val(S) = T if and only if M | — tt S. In that case, 
for all SGM, Val(S) = T. 

Basis: The thesis holds for all atomic and quantified sentences, by the definition of 
Val. 

Induction hypothesis: Suppose the thesis holds for arbitrary sentences Q and R of £. 
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Induction step: 

(- ) Suppose P = - Q. Val(P) = T iff Val(Q) = F iff (by IH) M\htQ iff (by Lemma 
4)Mht-Q- 

(-»-) Suppose P = (Q -» R). Fa/(P) = T iff Fa/(Q) = F or Fa/(R) = T iff (by IH) 
M^QorM ht R iff (by Lemma 4) M ht (Q -» R). 

But for all S e M, M U S. So for all SGM, Fa/(S) = T, i.e., M is tt-satisfiable. 

Enumerating the sentences of £. 

We will need to assume that there is an infinite list of the sentences of L. Since there are 
infinitely many names, variables and predicates in the language, we cannot expect to list 
them in alphabetical order. Here is how we can do it. Suppose we have an infinite list of 
the sentences that are two symbols long (e.g., Fa, Fb, etc.), and an infinite list of the 
sentences that are three symbols long, and so on. So we have an infinite number of 
infinite lists. Arrange the lists in a table, with each list occupying one column, thus: 





2 sym's 


3 sym's 


4 sym's 




1st 


















2nd 












3rd 

























And then, produce a single list of all sentences by following the zig-zag line through the 
table. (This is called zig-zagging through the table.) 

But now, how do we produce the list of two-symbol sentences? Here's how: Construct a 
table with a list of predicates running down the left and a list of names running across the 
top, and in each cell write the predicate followed by the name, thus: 
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a 

CI 


h 


c 




A 


Aa 


Ab 


Ac 




B 


Ba 


Bb 


Be 




C 


Ca 


Cb 


Cc 















Finally, produce the list of two-symbol sentences by zig-zagging through the table. 

Exercise: Describe a method for listing the three-symbol sentences. (Hint: Think "three 
dimensions".) 

Note: When it comes to listing, say, the seven-symbol sentences, the easiest thing might 
be just to describe a method for listing all strings of seven symbols of L and then say, 
"Go through the list of seven-symbol strings and add to the list of seven-symbol 
sentences each of those that happens to be a sentence of 

One more note about constructing such lists: If we really want to imagine generating the 
list of sentences, we cannot suppose that we are "given" a bunch of tables each of which 
has infinitely long columns and infinitely long rows. Rather, we have to imagine we 
have a set of instructions that allows us to add a cell to each of the tables we are using as 
we need it. 



Proof of completability of formally consistent sets. 
Let B be formally consistent. 

Let Ao, Ai , A2, ... be an enumeration of all atomic and quantified sentences (which 
can be produced in some such manner as that just described). 

Define M thus: 
Let Bo = B. 

For each i > 0, let B m = B t U {A,} if B t U {A,} \f tt 1. 
Otherwise, let B i+ \ = B h 

00 

Let M = Bo U Bi U B 2 U . . . = \jB i . (Clearly, B C M.) 

(In other words, S G M if and only if S G Bo or S G B\ or S G B 2 or . . . .) 
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M is formally consistent. 
Suppose not. That is, M \- tt _L 

Since proofs are finite, there is a smallest j such that Bj | — tt _L 
But by the construction, there is no such j. 

(The fact that proofs are finite means that only finitely many members of M are used 
in the proof. So we can pick the smallest j such that Bj includes them all.) 

M is formally complete. 

That is to say, for all sentences S in £, M \- a S or M | — t t --S. 
We prove this by induction. 

Basis: 

Suppose Ay is an atomic or quantified sentence and j is its place in the enumeration. 
Suppose M \f tt A,. In that case, by the construction of M, Bj U {A,} |- tt -L 
So by - -Intro, Bj \- n - A,. So M f- tt -A,. 
So either M K A, or M (- tt - Aj. 

Induction hypothesis: Suppose for arbitrary Q and R of £, M |- tt Q or M |— t t ->Q and 
Mht R orMht -R. 

Induction step: 

(-) Suppose P = -Q. 

By the induction hypothesis, M |— t t Q or M |— tt -> Q. 

Case(i): Mht-Q. 

Case : M Q. So we have a ;?roq/like (M, . . . , Q). 

Using .L-lntro, ->-lntro and ->-Elim, we can construct a proof like 
(M,(^Q, ...,Q,±>, — Q>. 
SoMK— Q. 

So in both cases, we have either M [ — tt P or M | — tt -> P. 
(-») Suppose P = (Q -» R). 

Case (7): M | — tt Q- In that case, we can construct a proof like this: 

(M, -Q, (Q, <-R, 1), — R, R), (Q - R)>. 
Case(ii): M|-^R. In that case, we can construct a proof like this: 

(M, (Q, R), (Q -> R)). 
Case (imp: By the induction hypothesis, if cases (i) and (ii) do not hold, then 

M | — tt Q and M |— tt -> R. In that case, we can construct a proof like 

this: (M,((Q - R), Q, R, .R, 1>, -(Q - R)>. 
So in all cases, we have either M \- a (Q -» R) or M | — « -> (Q -* R). 

This completes the proof of the Truth-functional Completeness Theorem. 



Lesson 4: The Completeness Theorem for First-order Logic 



Now we want to prove the completeness theorem for first-order logic: 
If A ^Q, then .4 |- Q. 

Recall that A \- Q means that where A is a set of sentences in the language £ and Q is a 
sentence in the language £, there is a proof of Q from the sentences in A (using any of 
our introduction and elimination rules). Here we are thinking of £ as a specific first- 
order language. When we need to be specific about the language, we will write, A \- £ Q. 

We have already proved that if A (=t t Q, then A \-a Q- So what we want to do now is 
"extend" that result from truth-functional validity to first-order validity. 

Let £+ be a language just like £ except that it contains denumerably many additional 
individual constants beyond those that £ contains. (We will also call individual constants 
names or just constants.) 

So A \- £+ Q will mean that where A is a set of sentences in £+ and Q is a sentence in £+, 

Q can be derived from sentences in^4 using our introduction and elimination rules (i.e., 
there is a proof in £+ using those rules). 

Where P is a well-formed formula of a first order language £, and n is an individual 
constant of £, and v is a variable of £, Pn/v, as before (in Lecture 2), is the result of 
substituting n for v wherever v occurs free in P. For example, BxRxy a/y = BxRxa. 

Outline of proof: 

We will show how to construct a set of sentences H (called the Henkin theory) in the 
language £+ that meets the following conditions: 

(i) The Elimination Theorem: 

Where A is a set of sentences of £ (the original language) and Q is a sentence of £, 
if A U H \- £+ Q, then A \- Q (i.e., A \- £ Q). 

(In other words, everything that we can prove (in £) with the help of H (in £+), we 
can prove without it. Only the members of H contain the extra constants of £+.) 
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(ii) The Henkin Construction Theorem: 

For every truth value assignment val, if for all P G H, Val(P) = T, then there is a 
structure "TO va i such that for all P of £+, if Val{P) = T, then P is true in K) va i. 

(In other words, if a truth value assignment Val assigns truth to every member of H, 
then there is a first-order structure that assigns truth to every sentence that Val 
assigns truth to. I write "2tW as a mnemonic, tell help you remember that it's the 
structure that corresponds to val.) 

Suppose we can establish the existence of such an H. The completeness theorem can 
then be proved as follows: 

Proof of completeness given a Henkin theory: 

Suppose A \=Q, where A is a set of sentences of £ and Q is a sentence of £. 

Then there is no structure TO va i such that every member of A U { -> Q} is true in K) va i. 

Then by the Henkin Construction Theorem, there is no truth value assignment val such 

that for every P GAUHU {^Q}, Val{P) = T. 

So ,4 U#|=ttQ. 

By the Truth-functional Completeness Theorem (applied to the language £+), 
AUH\- £+ tt Q. 

So (since every proof in £+ using just the sentential rules is also a proof in £+), 
AUH |- £+ Q. 

So by the Elimination Theorem, 
A \- Q. 

Some useful propositions: 

(Assume that |- is the syntactic consequence relation for an arbitrary first-order 
language.) 

Proposition 1 (the Deduction Theorem): 
ItA U {P} |- Q, then .4 |-(P ^ Q). 

Proof: Exercise. 
Proposition 2 (Syntactic Cut): 

If (i) A U {Pi, P 2 , . . ., P n } h Q and (ii) for all i, 1 < i < n, A \- P h then A \- Q. 
(Compare the lemma called "Semantic Cut" in L2.) 
Proof: 

Suppose we have a proof like this: 
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(AU {Pl,P 2 ,...,P n },...,Q>. 

By Proposition 1, applied n times, we have a proof like this: 

<4...,(Pl-(P 2 -*(...(Pn-*Q)...)>- 

By (ii), we have a proof like this: 

{A, (P t - (P 2 - ( ... (P n - Q)...), P h P 2 , P n >. 

So by n applications of -*Elim, we have a proof like this: 

(A,...,Q). 

Proposition 3 (Lemma 7 in Barwise and Etchemendy, p. 536): 

(1) If A HP ^Q) and .4 h (-P -» Q), then,4 \-Q. 

(2) If ^4 |— ((P^Q)^R),then^h(^P^R)and^h(Q^R)- 

Proof: 

Part (1): We know ,4 U {(P -* Q), (-P -» Q)} |-Q- So C 1 ) follows by Prop. 2. 
Par? (2): Exercise. 

Proposition 4 (comparable to Lemma 8 in Barwise and Etchemendy, p. 536): 
Suppose n does not occur in P or any member of A. 
Then if A |- (-Pn/v -» Q), then .4 |-(-VvP Q). 

Proof: Note: This where we use V-lntro. 

Suppose we have a proof like this: 

<4...,(-Pn/v-*Q)>. 

Then we can construct a proof like this: 

<^,<-VvP, <-Q, <n, <-Pn/v, (-Pn/v Q), Q, 1>, --Pn/v, Pn/v), VvP, 1), 
— Q, Q), (-VvP^Q)). 

Proposition 5 (compare Lemma 9 in Barwise and Etchemendy, p. 537): 
Suppose n does not occur in P or any member of A. 
Then if A U {(Pn/v VvP)} |-Q then^ |-Q. 
Proof: 

Supposed U {(Pn/v VvP)} |-Q. 

By Proposition \,A\- ((Pn/v -*■ VvP) -*■ Q). 

By Proposition 3 part (2), 

(i) A |- (-Pn/v -» Q), and 

(ii) ^ |- (VvP Q). 
From (i), by Proposition 4, 

(iii) A |- (-VvP -» Q). 

From (ii) and (iii), by Proposition 3, part (1), 

(iv) A |- Q. 
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Proposition 6 (compare Lemma 10 in Barwise and Etchemendy, p. 537): 

A \- (VvP Pn/v), and 

A \- ((Pn/v a n = m) -> Pm/v), and 

A \- n = n. 

Proof: Exercise. This is where we use V-Elim and =-Elim and =-lntro. 

Now we need to construct the Henkin theory. But first we need to construct the language 
of the Henkin theory, £+. We will do this in stages. At each stage, we construct not only 
a new language, but also a witness function for that language. 

Suppose we have a table of individual constants not in £\ 



noo 


nio 


n 2 o 


n 30 




n i 


nn 


n 2 i 


n 3 i 




n 2 


ni2 


n 22 


n 32 




n 3 


nw 


n 23 


n 33 















In other words, each of the extra constants of £+ should have place in this table so that 
we can identify it by its "double subscript". We will think of a language now as a set that 
includes all of the basic vocabulary and all of the wffs that can be built of from that 
vocabulary in accordance with the definition of a wff 

We further suppose that for any language we can produce an enumeration (a one- 
dimensional list) of all the formulas of that language containing exactly one free variable. 

We now define a whole series of languages starting with £ and culminating in £+. 

Let £ = £. 

Let wo be a function such that if Pqj is the y'th formula in an enumeration of the formulas 
of £o having exactly one free variable, then Wo(Pq/) = no,. 

For each i > 0, let 

£ i+ i = £i U {wi{P) | P is a wff of £ t with exactly one free variable}. 

(Notation: In general, {fix) \ . . . x . . .} stands for the set of things that results from 
applying function / to the things that satisfy the condition . . . x . . . .) 
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For each i > 0, let 



w,+i(P) = 



Wi(P) if PisawffofX, 

if P is the y'th wff in an enumeration of the wffs having exactly 
^ one free variable that are in L i+ \ but not in 



So the construction alternates between constructing the next language and constructing 
the next function. At each stage the sentences of the language constructed at that stage 
will contain all wffs that can be grammatically constructed using the witnesses for the 
formulas of every previous stage. 

CO 

Let £+ = [J X, . (In other words, the union of Lq, £\, Li, . . .) 

i=0 

00 

Letw= (Jw,. . 



(=0 



(In defining a function through unions, we are thinking of functions as sets of ordered 
pairs.) 

The situation can be illustrated in a diagram: 

W2 



w 



£2 £1 £0 dobl dob2 dob3 dob4 ... 




P20 


P10 


Poo 


n o 


nio 


n 2 o 


n 3 o 






P21 


Pll 


P01 


n i 


nn 


n 2 i 


n 3 i 






P22 


P12 


P02 


n 2 


ni2 


n 2 2 


n 3 2 






P23 


Pl3 


Po3 


n 3 


rii 3 


n 2 3 


n 3 3 








▲' 















Add this column to £0 to form L\ 
~ An enumeration of the formulas of L\ not also in £ 
An enumeration of the formulas of £2 not also in L\ 
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("dob" stands for date of birth) 

If w(P) = n, write Cp for n. So w(P) = Cp. Cp is a witness for P. Notice that by our 
construction, we have ensured that for every formula of £+ that has one free variable, 
that formula has such a witness. 

We can define the Henkin theory H as follows: 

A wff Q of £+ is a member of H if and only if either: 

(1) for some individual constant n, Q = n = n, or 

(2) P is a wff of £+ containing at most v free, and either 

(a) for some constants m and n, Q = ((Pn/v a n = m) -» Pm/v), or 

(b) for some contant n, Q = (VvP -» Pn/v), or 

(c) Q= (Pcp/v^VvP). 

Notice the use of "Cp" in this definition. In the case where P does not contain v free, 
assume that PCp/v = P. 

In the case where v is free in P, say that (PCp/v -*- VvP) is the witnessing axiom for Cp. 
(So where P does not contain v free, the witnessing axiom for P does not contain a 
witness.) 

If n is a constant in £o (=£), then the birth date of n is 0. 

If P is a formula in £o and wo(P) = n, say that the birth date of n is 1 . 

For i > 0, if P is a formula in £ ; but not in £,_i, and w,(P) = n, say that i + 1 is the birth 

date of n. 

dob(n) = i if and only if the birth date of n is i. 

In other words, the date of birth (dob) of a name is i iff £ t is the first language in the 
series containing formulas containing that name. 

Observation: 

If for some i > 0, dob(n) = i+ 1 , then n does not occur in any wff in any of £o, £\ , £2, 
£, 

The Independence Lemma: 

If Cp is not Cq and dob(Cp) < dob(Co), then Cq is not in the witnessing axiom for Cp. 
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Proof: 

Case 1: dob{Op) < dob(Co) = i + 1. In this case, the witnessing axiom for Cp belongs 
to a language earlier than L i+ \. So by Observation above, Cq is not in it. 
Case 2: dob(Cp) = dob(Co) = i + 1. In that case w,(P) = Cp and w ; (Q) = Cq. So P and 
Q belong to £ ; . So Cq is not in P and Cp is not in Q. So Cq is not in (Pcp/v -» VvP) 
and Cp is not in (QCq/v -*- VvQ). 

(We ignore the case in which dob(Cp) = dob(Co) = 0, because no witness has birth 
date ofO.) 

Finally, we are in a position to prove the Elimination Theorem. 
The Elimination Theorem: 

Where A is a set of sentences of £ (the original language) and Q is a sentence of £, 
ft A U//|- £+ Q,then^ |-Q. 

Proof: By induction on the maximum number k of members of //used (cited) in the 
proof of Q from AU H. 

Basis: k=0. Trivial. 

Induction hypothesis: Suppose that the thesis holds when at most k members of H are 
used in the derivation of Q from A U H. 

Induction step: Show that the thesis holds when k + 1 members of H are used in the 
derivation. Let [/be the members of //that are used. 

Case 1: At least one member of U is not a witnessing axiom. (It is a sentence of one 
of the following forms: (VvP -*■ Pn/v), ((Pn/v a n = m) -*- Pm/v), n = n.) By 
Proposition 6, we can prove that member from A and the remainder of U. This brings 
the number of members used down to k; so by the induction hypothesis, the thesis 
holds. 

Case 2: All of the members of U are witnessing axioms, i.e., of the form: 
(PCp/v VvP) 

If for some sentence of this form in U, PCp/v = P (i.e., v is not free in P), then, by 
Proposition 5, we can prove that member from A and the remainder of U. This brings 
the number of members used down to k; so by the induction hypothesis, the thesis 
holds. 
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Otherwise, each member of U contains a witness for some formula. Since U is finite, 
we can find a witness in a sentence in U that has a latest birth date (i.e., a birth date 
such that for every other witness in a sentence in U, its birth date is no later). Call 
this witness Cp*. So there is some formula P* such that w(P*) = Cp*, and U contains 

(P*C P */v VvP*). 

Since Cp* is not in £, Cp* occurs in no member of A and does not occur in Q. By the 
Independence Lemma, Cp* occurs in no other member of U either. 

Let U* be the set containing every member of U other than (P*Cp*/v -> VvP*). By 
Proposition 5, there is a proof of Q from AU U*. That proof may contain Cp* and 
other names not in £; however given that proof we can certainly find another one 
containing exclusively names in £. The number of members of //that are used in this 
proof will be no greater than k. So by the induction hypothesis, the thesis holds. 

End of proof. 

All that remains is to prove the Henkin Construction Theorem. 

Lemma: The Equivalence of Identicals: 

\fval is a truth value assignment for £+ such that for all PGi/, Val{P) = T (val satisfies 
H), then for all constants n, m, and o of £+, then 

(i) va/(n = n) = T, and 

(ii) if va/(n = m) = T then va/(m = n) = T, and 

(iii) if va/(n = m) = T and va/(m = o) = T, then va/(n = o) = T. 

(In other words, the relation between constants of flanking a true identity is an 
equivalence relation.) 

Proof: Exercise. (Hint: Look at the sentences that have to be in Hby the definition of 
that set.) 

Define [n] = {m | m is a constant of £+ and va/(n = m) = T}. (Call this the equivalence 
class for n relative to val.) 

Proposition 7: 

If for all P G H, Val{P) = T, then {([n], [m]) | va/(n = m) = T} is an identity relation. 
(I.e., if va/(n = m) = T, then [n] = [m].) 

Proof: Suppose not. Then there are n and m such that va/(n = m) = T, but [n] 4- [m]. 
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Case 1: There is a constant o such that o G [n] but o ^ [m]. In that case, va/(n = o) 
= T, but va/(m = o) 4 T. By Equivalence of Identicals (ii), va/(m = n) = T. So by 
Equivalence of Identicals (iii), va/(m = o) 4 T. Contradiction. 
Case 2: There is a constant such that G [m], but o [n]. Similarly. 

Proposition 8: 

If for all P G H, Val{P) = T and [n] = [o], then va/(n = o) = T. 

Proof: Suppose for all PGff, Val{P) = T and [n] = [o]. So suppose, for a reductio, 
that va/(n = o) 4 T. In that case, o [n]. But by the Equivalence of Identicals (i), 
n G [n]. So, contrary to assumption, [n] 4 [0]. 

Proposition 9: If for all P G H, Val(P) = T and [n] = [o] and Fa/(Pn/v) = T, then 
Fa/(Po/v) = T. Proof: Exercise. (Hints: Think about the definitions of [n] and H.) 

The Satisfaction Lemma: 

For every structure 110, every variable assignment g, and every formula Q, g[v/2(n)] 
satisfies Q in TO if and only if g satisfies Qn/v in TO. (g[v/2(n)] assigns to v the object 
that 2 assigns to n.) 

Proof: By induction on the complexity of wffs. 

Basis: Suppose Q = Rtit 2 ...t m . 

Case 1: For every i, \<i<m, t, 4- v. So Qn/v = Q. 

(h[v/xm(U), A[v/2(n)](t 2 ), A[v/Z(n)](t m )> = (h(U), h(t 2 ), h(\ m )). 

Case 2: For some i, \<i<m, t, = v. So Qn/v = Rtit 2 ...n...t m . 

(h[v/zm(U), A[v/2(n)](v), A[v/2(n)](t m )) = (h(U), 2(n),..., h{\ m )) = (h(U), 

...,h(n),...,h(\ m )). 

Induction hypothesis: Suppose the thesis holds for Q and R. 
Induction step: 

(-■): Suppose P = ->Q. Since, by the induction hypothesis, g[v/2(n)] satisfies Q in 
TO if and only if g satisfies Qn/v in 710, g[v/2(n)] satisfies P in 710 if and only if 
g satisfies Pn/v in 710. 
(v): Similarly. 
(V): Suppose P = VuQ. 

Case (i): u 4- v. By the induction hypothesis, for all o G D M , g[v/2(n)][u/o] 
satisfies Q if and only g[u/o] satisfies Qn/v. So g[v/2(n)] satisfies VuQ 
if and only if g satisfies Vu[Qn/v] = [VuQ]n/v. (This identity holds 
because u 4 v.) So g[v/2(n)] satisfies P if and only if g satisfies Pn/v. 
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Case (ii): U = v. So g[v/2(n)][u/o] = g[u/o]. So, trivially, for all o G D M , 

g[v/2(n)][u/o] satisfies Q if and only if g[u/o] satisfies Q. So g[v/2(n)] 
satisfies VuQ if and only if g satisfies VuQ. But also P = VuQ = (since 
v = u is bound) [VuQ]n/v = Pn/v. So g[v/2(n)] satisfies P if and only if g 
satisfies Pn/v. 

Next, we will do an inductive proof that uses the concept of the complexity of a 
sentence. So first, we need to extend the definition of complexity from Lesson 2 to cover 
the case of quantified sentences (and swap -» for v). Thus: 

Atomic sentences have complexity 0. 

If a sentence P has complexity n, then ->P has complexity n+l. 

If out of the two sentences P and Q, the complexity of the one with the greatest 

complexity is n, then the complexity of (P -» Q) is n+l. 
If Qn/v has complexity n, then VvQ has complexity n+l. 

The Henkin Construction Theorem: 

For every truth value assignment val, if for all P G H, Val{P) = T (i.e, vol satisfies H), 
then there is a structure T0 va i such that for all P of £+, if Val{P) = T, then P is true in 

Proof: Suppose that truth value assignment val that satisfies H. We show how to 
construct JO va i and then we prove something stronger, viz., for every sentence P of 
£+, Val{P) = T if and only if P is true in T0 va i. We prove the latter by induction on 
the complexity of sentences. 

First part: Construction ofT0 va i. 
310 ra/ = (D,2>. 

D = {[n] | n is a constant of £+}. ([n] is defined in terms of the given val.) 
For every constant n of £+, 2(n) = [n]. 
For every m-place predicate R of £+, 

2(R) = {([ni], [n 2 ], [n m ]> | va/(Rn 1 n 2 ...n w ) = T}. 

Nota bene: We are interpreting our language in a domain of objects that are 
equivalence classes of names of that same language! 
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We have to check to make sure that TO va i so defined really is a structure for £+. The 
domain of K) va i is nonempty and, by Proposition 7, 2(=) is the identity relation on the 
domain of TO va i. So yes, it is. 

Second part: The induction. We prove that Val{P) = T if and only if P is true in TO va i. 
Basis: Suppose P is atomic; i.e., P = Rnin 2 ...n m . 

Left-to-right: Suppose Val{P) = va/(Rnin 2 ...n m ) = T. By the construction of K) va i, 
<[ni], [n 2 ], [n m ]> e 2(R), and so <2(rii), 2(n 2 ), 2(n m )> E 2(R). So Rni^.-.n* 
= P is true in TO va i. 

Right-to-left: Suppose P = Rriin 2 ...n m is true in2Y) va /. By the definition of 2, 
<2(n0, 2(n 2 ), 2(n m )> = ([n{\, [n 2 ],..., [n M ]> G 2(R). So by the definition of 2(R), 
there are names Oi, o 2 , . . . O m , such that [Oi] = [n i], [o 2 ] = [n 2 ], [o m ] = [n m ], and 
va/(Roi0 2 ...o m ) = T. But in that case, by Proposition 9, va/(Rriin 2 ...n m ) = Val{P) = T. 

Induction hypothesis: Suppose that the thesis holds for sentences having complexity 
less than or equal to k. 

Induction step: Show that the thesis holds for sentences having complexity k + 1 . 
Case -■: P = ->Q. Exercise. 

Case P = (Q -» R). Val((Q R)) = T if and only if Val(Q) = F or Val(R) = T, 
which (by the induction hypothesis) is so if and only if Q is false in K) va i or R is true 
in TOvai, which is so if and only if (Q -> R) is true in TO va i. 

Case V: P = VvQ. We need to show that Fa/(VvQ) = T if and only if VvQ is true in 

Left-to-right: Suppose Fa/(VvQ) = T. 

Since for all constants n in £+, (VvQ -*- Qn/v) G H, 

for all n in £+, Fa/((VvQ Qn/v)) = T. 

So by the definition of Vol, for all n in £+, Fa/(Qn/v)) = T. 

By the induction hypothesis, for all n in £+, Qn/v is true in JO va i- 

So for all n in £+, g B satisfies Qn/v. 

So, by the Satisfaction Lemma, for all n in £+, g [v/2(n)] satisfies Q. 

By the construction of K) va u for every o G D, there is an n in £+ such that 2(n) = o. 

So for all o G D, g z [v/o] satisfies Q. 
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So g z satisfies VvQ. 
So VvQ is true in 70 va i. 

Right-to-left: Suppose VvQ is true in TO va i. 
For every o G D, g [v/o] satisfies Q. 

By the construction of K) va i, for every constant n in £+, there is an object o G D 
such that 2(n) = o. 

So for all n in £+, g a [v/2(n)] satisfies Q. 

So, by the Satisfaction Lemma, for all n in £+, g a satisfies Qn/v. 

In particular, g z satisfies QCq/v (Cq being the witness for Q. Recall that in case v 

is not free in Q, Qcq/v is Q). 

So QCq/v is true in "TO va i. 

By the induction hypothesis, Fa/(QCQ/V) = T. 

But since (Qc Q /v VvQ) G H, Fa/((Qc Q /v VvQ)) = T. 

So, by the definition of Vol, Fa/(VvQ) = T. 

End of proof. 

This completes the proof of the Completeness Theorem for first-order logic. 

Exercise: Where A is a set of sentences of £ and Q is a sentence of £, and H is the 
Henkin Theory for £ (in £+), show that A \- Q if and only if A U H | — tt Q- 
Hint: No inductions are called for. Assume the Soundness Theorem, as well as the 
Truth-functional Completeness Theorem. Use the Elimination Theorem and the stronger 
biconditional that we proved in order to prove the Henkin Construction Theory (Val{P) = 
T if and only if P is true in JO va i). 

The Compactness Theorem: 

Say that a set A of sentences of £ is first-order satisfiable (or first-order consistent, or 
just satisfiable) if and only if there is a first-order structure TO such that every sentence in 
A is true in 710 (in which case IK) first-order satisfies A). 

Here's the theorem: Suppose that A is a set of sentences of £ such that for all sets of 
sentences B, if B is finite and B QA, then B is first-order satisfiable. Then A is first-order 
satisfiable as well. 



Proof: Assume the hypothesis, and suppose, for a reductio, that A is not satisfiable. 
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Then ,4 \=1. 

By completeness, A\-±. 

But proofs are finite. So there is a finite set B QA, such that B |- _L. 
By soundness, B (= _L 

So 5 is not satisfiable, contrary to the supposition. 
So A is satisfiable. 

Alternative formulation: Suppose that A is a set of sentences of L and Q is a sentence of 

L, and A |= Q. Then there is a finite subset 5 of A such that 5 |= Q. 

Exercise 1: Prove the Compactness Theorem in its alternative formulation. 

Exercise 2: Prove that this alternative formulation is equivalent to the first formulation. 

(Using the alternative formulation, prove the compactness theorem, and using the 

Compactness Theorem prove the alternative formulation. You do not need to use any of 

our "big" theorems; just use definitions. Prove each in "contrapositive form".) 

This Compactness Theorem may not seem like a very exciting result, but it will be 
exciting to discover (as we will in Lesson 13) that satisfiability for second-order 
languages is not compact. 

Some background on the concept of infinity 

One kind of infinity is that of the natural numbers, 0, 1,2, ... Any set that can be put into 
one-one correspondence with the natural numbers is said to be denumerable. A set is 
said to be countable if it is either finite or denumerable. For example, the set of even 
positive integers is denumerable too, even though not all natural numbers are even: 

1 2 3 ... 

2 4 6 8 ... 

The set of nonnegative rational numbers is denumerable as well. Every nonnegative 
rational number greater than appears in the following table: 
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1 


2 


3 


1 


1/1 


1/2 


1/3 


2 


2/1 


2/2 


2/3 


3 


3/1 


3/2 


3/3 



By zig-zagging through this table, we can put the natural numbers into one-one 
correspondence with the nonnegative rational numbers. (Skip duplicates.) 

1 2 3 4 5 6 ... 

1/1 2/1 1/2 1/3 3/1 4/1 ... 

However, there are infinite sets that cannot be put into one-one correspondence with the 
natural numbers. For example, the set of all subsets of the natural numbers cannot be put 
into one-one correspondence with the natural numbers. 

The set of a subsets of a set is called the power set of that set. In fact, no nonempty set 
can be put into one-one correspondence with its own power set. (This is known as 
Cantor's Theorem.) 

Proof: By reductio. Suppose that /is a one-one function from the members of A into 
its power set. Define the following set: 

B = { n | n G A and n tfcfln)} 

But B is a subset of A. So, by the supposition, there exists a member of A, b, such 
that/(6) = 5. Question: Does b belong to Bl If yes, then no. (If b G B, then b £flb) 
= B.) If no, then yes. (lib B, then, since b GA, b can be disqualified from 
membership in B only because b £=f{b). Butf(b) = B; so b G B after all.) So yes if 
and only if no. Contradiction! So we were mistaken in thinking that there was any 
such function / 

Likewise, there is no 1-1 correspondence between the natural numbers and the 
nonnegative real numbers. In fact, there is a 1-1 correspondence between the 
nonnegative real numbers between and 1 and the power set of the set of natural 
numbers. (Can you prove it? Hint: Write the real numbers in base 2 and think of the 1 's 
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and O's as saying "yes" and "no" to each natural number. Incidentally, the set of 
nonnegative real numbers can be put into 1-1 correspondence with the set of all real 
numbers.) 

If two sets can be put in 1-1 correspondence, then they are said to have the same 
cardinality. So the point is: There are many infinite cardinalities. 

The set of sentences of £ is denumerable. That's what we found out when we devised a 
method for listing the sentences of L one after the other (in Lesson 3). In fact, even if 
there are denumerably many different languages and denumerably many sentences in 
each of them, the set consisting of all sentences in all languages is denumerable. 
{Exercise: Think of a zig-zag procedure that would list them all.) 

So now you know that there are various kinds of infinity. There's the infinity of the 
natural numbers (denumerable infinity), the infinity of the nonnegative real numbers (the 
continuum), and so on. This fact raises the following question: For any given kind of 
infinity, can we construct a set of sentences such that it is satisfiable only in a domain 
having at least that kind of infinity? 

Well, we can certainly construct a set of sentences - even a very simple, finite set of 
sentences - that is satisfiable only in structures having denumerable domains. Consider, 
for instance, the following three sentences: 



Vx--Rxx 
Vx3yRxy 

VxVyVz((Rxy a Ryz) Rxz) 



To see that these three sentences are jointly satisfiable only in a domain containing at 
least denumerably many members, think of R as meaning "larger than". 

So can we likewise construct a set of sentences that is satisfiable only in domains having 
least as many members as the set of nonnegative real numbers? Surprisingly, the answer 
is no. The following theorem proves it: 

The (downward) Lowenheim-Skolem Theorem: If a set of sentences of £ is first-order 
satisfiable, then it is first-order satisfiable in a structure with a countable domain. 

Observation 1: \fA is first-order satisfiable, then^ |A _L For if^4 is first-order 
satisfiable, then A \k _L, which, by the soundness theorem, implies that A |/- _L. 
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Observation 2: Our proof of the Henkin Construction Theorem proves something 
stronger: For every truth assignment vol that tt-satisfies H, there is a first-order structure 
TOvai such that: 

(1) for all sentences P in £+, Val{P) = T if and only if P is true m~TO va i, and 

(2) "TOyai has a countable domain (either finite or denumerable). 

(2) is so, because the domain of K) va i is D = {[n] | n is a constant of £+} and there are 
denumerably many constants in £+. 

Proof of the (downward) Lowenheim-Skolem Theorem: 
Suppose A is first-order satisfiable. 
By the First Observation, A (/■ J.. 

So by the Elimination Theorem (where H is the Henkin theory in £+), A U H \/- £+ _L. 
SoAUH\/- tt ±. 

By the Truth- functional Completeness Theorem, A U H \f= _L 

So there is a truth value assignment val such that for all P G A U H, Val(P) = T. 

So, by Observation 2, there is a first-order structure K) va i with a countable domain 

such that for all P of £+, if Val(P) = T, then P is true in J0 vah 

IPOvai is a first-order structure with a countable domain that first-order satisfies A. 

Note 1: What Skolem actually proved, in 1919, was that if a set of sentences is first- 
order satisfied in a structure with a nondenumerable domain, then it is first-order 
satisisfied in a structure that is a restriction of the first to a countable domain. (For a 
proof, see Boolos and Jeffrey, chapter 13, or Boolos, Burgess and Jeffrey.) 

Note 2: What I have here called the Lowenheim-Skolem theorem is also called the 
downward Lowenheim-Skolem theorem to distinguish it from the upward Lowenheim- 
Skolem theorem, which says that if a set A of sentences of £ is satisfiable in any structure 
having an infinite domain, then for any infinite set, A is satisfiable in a structure having 
the same cardinality as that set. We will not prove this, but it will come up again in our 
discussion of second-order logic in lesson 13. 

Note 3: Suppose that 110 and 1ft are two first-order structures for a language £. Let map 
be a 1-1 function whose domain (the set of inputs) is the domain 110 and whose range (the 
set of outputs) is the domain of 1ft. Then we say that HO and 1ft are isomorphic if and only 
if: 
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(i) for all constants C of L, 2 w (c) = o if and only if 2 jt (c) = map(o), and 

(ii) for all n-ary predicates P of £, {o\, 02, o a ) G 2 M (P) if and only if 
(map(o\), map(o 2 ), map(o n )) G 2 m (P). 

A simple proof by induction shows that if two first order structures are isomorphic, then 
for any set of sentences T of £ either both are models for T or neither is. 

Suppose that T is a set of sentences such that if 210 and HI are any two structures that first- 
order satisfy T, then 710 and 1ft are isomorphic. This property is called categoricity. A set 
of sentences that has it is a categorical theory. Are there any sets of sentences that are 
categorical in this sense? If a set of sentences is satisfied only by structures having finite 
domains, then the answer is, yes. (There might be a sentence in T that tells us exactly 
how many objects there are.) But if a theory is satisfied only by structures having infinite 
domains, then the answer is, no. If a theory is satisfiable only by structures having 
infinite domains, then it will be satisfiable in nonisomorphic structures with infinite 
domains. This is an immediate consequence of the upward Lowenheim-Skolem theorem. 
However, in second-order languages we can write categorical sets of sentences. 



Lesson 5: Preliminaries Before We Take On Godel- 
incompleteness 



Function symbols 

We need to add some vocabulary to the language of first-order logic and augment the 
definition of satisfaction to allow for it. Shortly, we will be talking about the language of 
arithmetic, which will include vocabulary like "+" and "x", which are symbols for 
functions. 



The syntax of function symbols: 

Recall that the terms of a first-order language L include the individual constants (names) 
and individual variables of the language. Now we will define the set of terms of L as 
follows: 



t is a term of L if and if either: 

(a) t is an individual constant of L, or 

(b) t is an individual variable of £, or 

(c) f is an n-place function symbol of £, t-i, \.2, 



. . . t n are terms of L and t = f(t-i, \.2, ■■■ t n ). 



For example, if "+" and "x" are 2-place function symbols of L, then +(x, a) is a term of 
L, +(b, c) is a term of L, and x(+(b, c), +(x, a)) is a term of L. 

For convenience, we will write +(v, u) as (v + u) and x(v, u) as (v x u). 

Wffs and sentences are defined as before, except that "terms" now includes terms of the 
new kind. 



The semantics of function symbols: 

Where D is a set of objects (a domain), we say that fun is an n-ary function on D if and 
only if fun is a set of n+\ -tuples of members of D such that for all x and y, if 

(o[, 02, o n , x) G fun and {o\, 02, o n , y) £E fun, thenx = y. 
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For example, in the domain of natural numbers the function addition = {(0, 0, 0), 
(0,1,1), (1,0, 1), (0, 2, 2), (1,1, 2),...}. 

To accommodate function symbols, we now extend the definition of an assignment, 2 w , 
so that if f is an n -place function symbol of L, then 2 M (f) is an n-ary function on D. 

We then define a term assignment h recursively, as follows: 

Where g is a variable assignment in 110, and 2 is an assignment for 710, and t is a term of 
£, /z(t) = o if and only if either: 

(i) t is an individual variable and g(t) = o, or 

(ii) t is an individual constant and 2(t) = o, or 

(iii) for some terms U,h, ■■■ t n and some function symbol f, and some n-ary function 
fun, t = f(ti, \.2, ... t„), and 2(f) =fun and o =fun{h{U), h{\.2), h{\. n )). 

For example, if 2(2) = the number 2, and g(x) = the number 3, and 2(+) = addition, then 
h(+(x, 2)) = the result of adding 3 and 2, i.e., 5. 

The rest of the definition of satisfaction by a variable assignment in a structure (from 
Lesson 1) can stand without change. Likewise, the definition of truth and the definition 
of first-order consequence (from Lesson 1) are unchanged. 

Note: Anything that can be said in a language with function symbols can be said in a 
language without them. Instead of a two-place function symbol +, we could have a three 
place predicate Add. And then, when we want to say, for example, that the sum of any 
number is greater than or equal to its addends, instead of saying, 

VxVy((x + y) > x a (x + y) > y), 

we could say, 

VxVyVz(Add(x, y, z) -» (z > x a z > y)). 

Axiom systems 

The style of doing proofs that you have learned is a "Fitch-style natural deduction 
system". (There are other kinds of natural deduction systems, e.g., Gentzen style. See 
John N. Martin's Elements of Formal Semantics for an approach to logic based on that.) 
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Another style of doing proofs is by means of an axiom system. These are usually easier 
to state than natural deduction systems, but much harder to use. But since they are easier 
to state, most metatheoretic work is formulated in terms of axiom systems. 

Usually, an axiom system is formuated as a number of axiom schemata and a number of 

inference rules: 

The axiom system PL (for "propositional logic"): 

Three axiom schemata: 
LI: (P - (Q - P)) 

L2: (P - (Q - R)) - ((P - Q) - (P - R)) 
L3: ((.p-.Q)-(Q-P)) 

Every wff having the form of LI, L2, or L3 is an axiom. (There are infinitely many of 
these.) 

One inference rule: 

Modus Ponens (MP): Q is an immediate consequence of P and (P -» Q). 

A proof in PL is a finite sequence of wffs such that each member of the sequence is 
either an instance of L1-L3 (an axiom) or is an immediate consequence of earlier 
members by Modus Ponens. 

The axiom system QL (Tarski 1965, Kalish and Montague 1965) 

Everything in PL, plus: 

Four more axiom schemata: 

L4: (Vv(P -* Q) -* (VvP VvQ)) 

L5: (P -» VvP), provided v does not occur in P (vacuous quantification) 
L6: ->Vv->v = t, where t is any term. (In other words, 3v v = t.) 
L7: (v = t -» (P -> Q)), where P is an atomic formula, and Q results from P by 
replacing any one occurrence of v in P with t. 

One more inference rule: 

Generalization: VvP is an immediate consequence of P. 
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A proof in FOL is a finite sequence of wffs such that each member of the sequence is 
either an instance of L1-L7 (an axiom) or is an immediate consequence of earlier 
members by Modus Ponens or Generalization. If there is a proof having P as its last 
formula, we write |- P. In that case we also say that P is a theorem of QL. 

Note: In two ways this concept of proof is different from what you are accustomed to. 
First, there are no premises. So the conclusion of every proof is a logical truth. Second, 
not only sentences, but also well-formed formulas containing free variables may belong 
to proofs and may be the thing proved. (We can always derive a sentence from these by 
an application of Generalization.) 

Let P be any wff (not necessarily a sentence) of a first-order language L. (Remember: L 
contains the identity sign.) We will say that P is first-order valid ((= P) if and only if for 
every first-order structure TO for L, and every variable assignment g in TO, P is satisfied 
by gin TO. 

Notice how this definition quantifies over every variable assignment rather than referring 
to the empty variable assignment. The reason for the change is that we now want to 
allow that a formula containing free variables may be first-order valid. 

We can prove soundness and completeness theorems pretty much as before (although we 
will not bother to actually do that): 
|- P if and only if |= P. 



It is often maddeningly difficult to prove even the simplest theorems. 



Example 1: For all P, Q, |- (- P (P -» Q)) 



-Q^-P)^(P^Q) (byL3) 

((^Q - -.P) - (P - Q)) - (^P - ((^Q - -.P) - (P - Q)))) (by LI) 
-P -> ((-Q -> -P) -> (P -* Q))) (by MP from 1,2) 

H>-(H3-^P)-*(P-*Q)))- 

(-P - (-Q - -P)) - (-P - (P - Q))) (by L2) 

(-P -» (-Q -» -P)) -» (-P -» (P -» Q))) (by MP from 3, 4) 

-P-»(-Q-»- -P)) (byFl) 

- P (P -» Q)) (by MP from 5, 6) 



For the remaining examples, I will assume, as a theorem, that if P is a tautology, 
then |- P. 
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Example 2: If v does not occur in P, then |- (Vv(P -> Q) -» (P ->VvQ)) 

1. (Vv(P -» Q) (VvP VvQ)) (by L4) 

2. (P -» VvP) (by L5) 

3. ((P^VvP)^ 

((Vv(P -* Q) -» (VvP VvQ)) (Vv(P -* Q) -»(P VvQ)))) 
(by the above theorem) 

4. ((Vv(P -* Q) — (VvP — VvQ)) — (Vv(P — Q) ->(P VvQ)))) 
(by MP from 2, 3) 

5. (Vv(P -»> Q) ->(P — VvQ))) (by MP from 1, 4) 



Example 3: |-(VxFx -> Fa) 



x = a (Fx -* Fa) (by L7) 

(x = a -> (Fx -> Fa)) -> (- Fa -> (Fx -> -x = a))) (by the theorem) 

-Fa -> (Fx -> -x = a))) (by MP 1,2) 
Vx(-Fa -» (Fx -» -x = a))) (by Generalization from 3) 

Vx(- Fa (Fx -x = a))) (Vx- Fa Vx(Fx -x = a))) (by L4) 

Vx- Fa Vx(Fx -x = a)) (by MP 4, 5) 

Vx(Fx -x = a) -> (VxFx -> Vx -x = a)) (by L4) 

(Vx-Fa -* Vx(Fx -x = a)) -« ((Vx(Fx -> -x = a) -. 

VxFx — * Vx-x = a)) -*■ (Vx-Fa — * (VxFx — * Vx-x = a)))) (by the theorem) 

(Vx(Fx -x = a) -* (VxFx Vx-x = a)) -. 

Vx-Fa (VxFx Vx-x = a))) (by MP 6, 8) 

Vx-Fa -* (VxFx Vx-x = a)) (by MP 7, 9) 
- Fa Vx- Fa (by L5) 

(-Fa -. Vx-Fa) -* ((Vx-Fa -^(VxFx Vx-x = a)) -* 

- Fa -*■ (VxFx -» Vx-x = a)))) (by the theorem) 

(Vx-Fa -. (VxFx Vx-x = a)) -. (-Fa — (VxFx Vx-x = a))) 
by MP 11, 12) 

- Fa -> (VxFx Vx-x = a)) (by MP 10, 13) 

(-Fa (VxFx Vx-x = a)) (-Vx-x = a -» (VxFx Fa))) 

by the theorem) 

-Vx-x = a (VxFx Fa)) (by MP 14, 15) 
-Vx-x = a (byL6) 
(VxFx -h. Fa) (by MP 16, 17) 
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The language of arithmetic 

For purposes of coding formulas with numbers (Godel numbering), we will want to the 
number of basic symbols in our language to be a prime number. So we will confine our 
attention to a first-order language containing just the following 13 symbols: 

0'()f*v-^V = <# 

Although, offically, the language contains just these symbols, we will use a use a number 
of "abbreviations" to make our meaning clearer: 

1 abbreviates 0'. 

2 abbreviates 0". 

3 abbreviates 0"'. 

x, y, etc., will abbreviate v*, v**, etc., on an ad hoc basis. 

(x + y) abbreviates f*(xy), which abbreviates f*(v*v**). 
(x • y) abbreviates f**(xy). 

denotes the number (notice the difference in font), = denotes identity, < denotes the 
relation of being less than or equal to, + denotes addition, and • denotes multiplication. 

The symbol ' (the "prime") denotes the successor function. So (0' + 0"')" denotes the 
successor of the successor of the sum of the successor of and the successor of the 
successor of the successor of 0, namely, (1 + 3) + 2 = 6. 

1 will return to the meaning of # later. 
For example, the sentence 

Vv*Vv**Vv***(0' ^ v** - * f**(v*v***) ^ f**(f*(v*v**) v***)) 

will be abbreviated as follows: 

VxVyVz(1 < y -» (x • z) < ((x + y) • z)). 

Since we are now using multiple vocabulary items to form a single variable and to form a 
single function symbol, our definition of well-formed formula will have to include extra 
clauses. We will call the language La (the "language of arithmetic"). 
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Definition of La {the language of arithmetic): 

f* and f** are the two-place function symbols of La. 

v followed by one or more *'s is a variable of La. 

t is a term of La if and only if either 

(i) t is 0, or 

(ii) v is a variable and t is v, or 

(iii) t is a term and t is to' (the term that to is followed by a prime), or 

(iv) t and ti are terms, f is a two-place function symbol and t is f(t , ti). 

We will call and the terms consisting of followed by one or more primes numerals. 

P is an atomic formula of L a if and only if for some terms to and ti, P is to = ti, or 
Pis t < ti. 

P is a wff of La if and only if either 

(i) P is an atomic formula, or 

(ii) for some wff Q, P is ->Q, or 

(iii) for some wffs Q and R, P is (Q -» R), or 

(iv) for some wff Q and some variable v, P is VvQ. 

We will also use a, v, ** and 3 as abbreviations in the usual way. 

Notice that I am using the sans serif Arial font both for the language of arithmetic and for 
metalinguistic variables. For example, when I wrote "t is 0" in the definition of terms, 
"t" was a metalinguistic variable that I use to talk about the language of arithmetic, and 
"0" was part of the object language, the language of arithmetic that I am talking about. 
You will have to determine from the context which is which. 

Also, recently I have been writing "is" rather than "=", because I did not want you to 
confuse the "=" in the metalanguage, which I use to talk about La with the "=", which is 
a predicate of La. In the future, I will not hesitate to write "=". There is a hard-to-see 
difference in font (Arial for the object language, Times New Roman for the 
metalanguage), but you should also be able to tell from the context which identity symbol 
I am using. 
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The base 13 numbering system 

We will count in a base 13 numbering system. That means that we will have single digit 
that means 10, a single digit that means 1 1 and one that means 12. 

10 11 12 

T| £ 8 

(eta) (epsilon) (delta) 

Whereas in a base ten number system "10" denotes the number ten, in a base thirteen 
number system "r|" denotes the number 10. In base thirteen, "10" denotes the number 
thirteen. "11" denotes fourteen (thirteen plus one), "lri" denotes 23 (thirteen plus ten). 
"80" denotes one hundred and fifty-six (twelve times thirteen). "82" denotes one hundred 
and fifty-eight (twelve times thirteen plus two). 

We will usually be talking about numbers at a high level of abstraction; so it will not be 
necessary for us to learn to mentally calculate in base thirteen. It will be good enough 
that we understand in principle how to calculate in base thirteen 

Godel numbering 

We are going to "code up" the language of arithmetic in natural numbers. An expression 
is just any sequence of symbols in the language of arithmetic — whether it forms a well- 
formed formula or not. With one qualification, we will assign to each expression of the 
language of arithmetic — indeed, to each finite sequence of expressions — a unique natural 
number. This assignment is called a Godel numbering. 

To each of the basic vocabulary items in the language of arithmetic, we will assign one of 
the first thirteen natural numbers (0 through 12), which we will write in base thirteen. 
Here is the assignment: 

0'()f* v -^V = <# 

I I I I I I I I I I I I I 
1023456789ris8 
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To find the Godel number of any complex expression, just write the numeral for the 
Godel number of each of the basic vocabulary items in the same order as they occur in 
the expression. Thus, the Godel number of f*(v) (which is meaningless), in base thirteen, 
is 45263, i.e., writing now in base ten, (4 x 13 4 ) + (5 x 13 3 ) + (2 x 13 2 ) + (6 x 13) + 3. 
The Godel number of v=v^ (this is not a typo, just an intentionally meaningless string) in 
base thirteen, is 6r|6s, i.e., in base ten, (6 x 13 3 ) + (10 x 13 2 ) + (6 x 13) + 11. 

Be careful to distinguish between numbers and the numerals that denote them. In base 
ten, the numeral that denotes the number ten is "10". In base thirteen, the numeral that 
denotes the number ten is "r|". In our language for arithmetic, the numeral that denotes 
the number ten is " 0""""" " (that's the numeral for zero followed by ten primes). A 
Godel numbering is an assignment of numbers to expressions, not an assignment of 
numerals to expressions. But it's easy to get confused because we care a lot about the 
numerals that we use to denote the numbers (so much so that we switch to base 13). 

Each of the numerals in the language of arithmetic itself has a Godel number. The Godel 
number of the numeral "0" is (now I'm writing in base 13) 1 . The Godel number of the 
numeral "0"' (notice the prime) is (now writing in base 13) 10, i.e., (now writing in base 
10) 13. The Godel number of the numeral "0"" is (writing in base 13) 100, i.e., (writing 
in base 10) 13 2 . In general, for any numeral consisting of "0" followed by n primes, the 
Godel number of that numeral can be written, in base 13, with "1" followed by n 
occurrences of "0". 

As I said, we also want to assign to each sequence of expressions a unique number. 
That's where the symbol "#" comes in. Instead of representing sequences of formulas in 
the usual way, with commas and corner brackets, thus: 

((Fa Gb), Fa, Gb) 

we will represent sequences by writing the expressions in order separated by "#"s, thus: 

#(Fa^Gb)#Fa#Gb# 

So now, since "#" has a Godel number, namely, 6, we get a Godel number for each finite 
sequence of expressions as well. We will call these expressions too. (So, looking ahead, 
each proof will have unique Godel number, since proofs can be defined as a kind of 
sequence of formulas.) 

Above I said that "with one exception" we would map every expression into a number. 
The exception is any expression of more than one symbol that begins with a prime: ' . 
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The code of the prime is 0. But we cannot say that the code of " is 00, because that's just 
0. Likewise we cannot say that the code of is 007, because that's just 7, the code of 
-■. So we will not assign codes to such expressions. That's not a problem since no well- 
formed formula begins with a prime. 

Notice that because each symbol corresponds to a single digit, every number is the Godel 
number of some expression. Leaving aside the symbols of more than one expression that 
begin with a prime, the mapping of expressions to natural numbers is one-one. 
(Systems of Godel-numbering don't always have that property.) 



Lesson 6: Sets of Numbers 



Arithmetic Sets 

Throughout we will assume that we are dealing with a structure in which each constant of 
the language of arithmetic receives its intended interpretation. So the number zero is 
assigned to 0, the successor function is assigned to ', the addition function is assigned to 
+, the multiplication function is assigned to •, and the relation of being less than or equal 
to is assigned to ^. The domain of the structure consists of the natural numbers, 0, 1,2, 
etc. As usual, variable assignments will assign members of the domain, viz., natural 
numbers, to variables, and we may speak of a variable assignment as satisfying or not 
satisfying a formula. In speaking of satisfaction, we will not bother to mention the 
structure, because we assume that we are always dealing with the intended interpretation 
that I have just described. 

We can use the language of arithmetic and the concept of satisfaction by a variable 
assignment to define sets of numbers. For example, we can say that the formula 3 ^ x 
(recall that this is an abbreviation of 0"' ^ v*) defines a set of numbers, namely, those 
numbers n such that g [x/n] satisfies 3 ^ x. That would be, of course, the set of natural 
numbers greater than or equal to 3. 

But since each natural number has a name in the language of arithmetic, we can get the 
same effect without bringing in satisfaction and variable assignments. We will use the 
following convention. If n is a natural number, then n is the numeral that denotes n in 
the La, the language of arithmetic. (I could let the change in font mark the difference, but 
that would not work very well when we write on the blackboard.) For example, if n is 4, 
then n is 0"" (and the Godel number of n is, in base 13, 10000). (I don't mean that if n 
is 4, then n and 0"" denote the same number; I mean that the symbol n is the following 
symbol: 0"".) 

So we can use the formula 3 ^ x to define a set of numbers by using it to define the set A 
as follows: For all natural numbers n, 

3 ^ n is true if and only if n G A. 

According to this definition, of course, A is the set of natural numbers greater than or 
equal to 3. For another example, we can define the set B as follows: 

3y(1 < y a n = (5 • y)) is true if and only if n G B. 
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This means that B is the set of numbers that are multiples of 5 (5, 10, 15, etc.). 

Let F(v) be any formula of La in which v is the sole free variable. F(v) is said to express 
a set of natural numbers A if and only if for all natural numbers n: 

F( n ) is true if and only if n G A. 

A set of m-tuples is called an m-ary relation. Let F(Vi, . . . , v m ) be a formula of La in 
which Vi, . . . , v m are the sole free variables, where these are the first m variables in a 
given enumeration of the variables of La, and in which all of them do occur free. Then 
F(Vi, . . . , v m ) expresses the relation R of m-tuples if and only if: 

F(n i, n m ) is true if and only if (m, n m ) G R. 

The reason why we require the formula that expresses a set to contain free all of the first 
m variables in an enumeration of the variables is that in that way we can know which 
"place" in an m-tuple corresponds to a given variable. 

A set or relation is called arithmetic (pronounced with accent on third syllable) if and 
only if it can be expressed by a formula of La. 

Zo-formulas and sets 

Within the class of arithmetic sets we can distinguish some important subsets. Toward 
defining a special class of arithmetic sets, we first define the concept of a 2 -formula. 
(This use of "2" has nothing to do with my use of it in defining first-order structures.) 

An atomic 2 -formula is an atomic formula of La having one of the following four 
forms: (Ci + C2) = C3, (Ci • C2) = C3, Ci = C2, or Ci ^ C2, where each of Ci, C2, and C3 is 
either a variable or a numeral. 

Examples: (1 + 2) = 3, (3 + 2) = 1 , (1 + x) = 3, 5 = x, 5 < x. 

We now define the set of 2 -formulas by means of the following four statements: 

1 . Every atomic 2 -formula is a 2 -formula. 

2. If P and Q are 2o-formulas, then -> P and (P -» Q) are 2 -formulas. 

3. If P is a 2 -formula, v is a variable, and C is a numeral or a variable distinct from 
v, then Vv(v < c -» P) is a 2 -formula. 

4. Nothing else is a 2 -formula. 
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Another notational convention: We will write formulas of the form Vv(v ^ c -* P) this 
way: (Vv < c)P. For example, Vx(x < 10 -> (10 • x) < 1000)) will be written: 
(Vx < 10)((10 • x) < 1000). (3v < C)P abbreviates - (Vv < c)-P, which is equivalent to 
-Vv(v < c -.P). 

The expressions (Vv < c) and (3v < c) are called bounded quantifiers. So we can say 
that 2 -formulas are formulas of La in which all quantification is bounded. (So we say 
that quantifiers are bounded (by numbers); whereas we say that variables are bound (by 
quantifiers).) 

Notice that atomic 2o-formulas are defined in terms of variables and numerals, not other 
kinds of terms. This means that the following is not a 2o-formula: (x + (y + z)) = w. 
We can express that same (four-place) relation with a 2o-formula, but we have to use 
some devious means: (3u < w)((y + z) = ua(x + u) = w)). (The addition of the bound 
on the quantifier does not change the extension, because we can be sure that none of the 
addends is greater than the sum.) 

What is special about ^-sentences (i.e., with no free variables) is that we can always 
determine whether they are true just by calculation — adding and multiplying — by 
operations for which there is a definite mechanical procedure. Since all quantification is 
bounded, we never have to do more than finitely many such calculations in order to 
determine whether a 2o-sentence is true. (Note, though, that, however little time each 
step takes, some finite calculations will take longer than the age of the earth to complete.) 

Suppose that A is a set of natural numbers or a relation on the natural numbers 
expressible by some 2o-formula. Then^4 is a 2 set or relation. 

Let A be a 2 set. Then A is decidable in the following sense: Given any natural number 
n, there is an algorithm that definitely tells us after finitely many steps either that n does 
belong to A or that n does not belong to A. Proof: Let F(v) be the 2o-formula with one 

free variable, v, that expresses A. To decide whether n belongs to A, all we have to do is 
calculate whether the sentence F( n ) is true. If so, then n belongs; otherwise not. 
Similarly, if F(Vi, v m ) is a 2 -formula in which exactly Vi, v„, are free, the 2 - 
relation that it expresses is decidable (in the sense that we can decide whether any given 
« -tuple is a member). 



L6: Sets of Numbers 



3/19/10 6:37PM 



Page 80 



Zi-formulas and sets 

Next, we define the 2i formulas, sets and relations. A Ij-formula is a formula of the 
form, 3v m +iF(Vi, v m , v m+ i), where F(Vi, v m , v m +i) is a 2 -formula with m+l free 
variables. So a 2i-formula begins with one unbounded existential quantifier, and all of 
the rest of the quantifiers in it are bounded quantifiers. A set or relation is a 2i set or 
relation if and only if it is expressible by a 2 1 -formula. 

We can define the sets and relations that are recursively enumerable (r.e.) to be the 2i 
sets and relations. Here is why that is a reasonable definition. (For the moment, I 
confine my attention to sets, excluding relations.) Suppose A is a Zi set. Then there is a 
2 -formula F(v, w) such that 3wF(v, w) expresses A. Now consider the following table: 



v 1 2 3 4 



w 



Let us suppose that the check marks indicate the pairs of numbers that satisfy the formula 
F(v, w). Since F(v, w) is 2 , we can simply calculate whether any given pair of numbers 
satisfies F(v, w) (i.e., for any pair of numbers n and m, we can calculate whether F( n , m ) 
is true). So by zig-zagging through this table, we can produce a list of all the numbers 
that satisfy 3wF(v, w) (i.e., of all of the numbers n such that 3wF( n , w) is true). (From 
the table, we can tell that 0, 1, 2 and 4 do, but we can't tell yet about 3.) 

Similarly, for 2i relations. For example, let R be a set of ordered pairs; so R is a two- 
place relation. Let 3wF(u, v, w) be the 2i-formula that expresses it (so that F(u, v, w) is 
a 2o-formula). Consider the following table: 
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<u,v) (0,0) (1,0) (0,1) (2,0) 



w 



Across the top we have a list of all pairs of natural numbers (which we could produce by 
a separate zig-zag operation). The check marks indicate triples that satisfy the 2o fromula 
F(u, v, w). By zig-zagging across this table we could produce a list of all pairs of 
numbers that satisfy the formula 3wF(u, v, w). ((0, 0), (1, 0), and (2, 0) do; we can't tell 
yet about (0, 1).) 



So if a set or relation is recursively enumerable in the sense defined, then we have a 
mechanical method, i.e., algorithm, for producing a potentially infinite list such that 
every member of the set is certain to eventually show up on the list. Notice, though, that 
a set might be recursively enumerable in this sense and still not be decidable. That is, we 
may not have a mechanical method which, given any number (or n-tuple of numbers), 
tells us whether or not it is a member of the set. Suppose that A is recursively 
enumerable and n is not in A. Well, we can start using our method of generating a list of 
the members of A. But there may never come a point at which we can be sure that we 
have gone on long enough and can conclude that n is not going to show up in the list. In 
terms of the table, there might be a column containing no check at all; but nothing tells us 
that if we continue zig-zagging through the table we will not eventually come upon a 
check in that column. 

Where A is a set (of natural numbers), the complement of A, written A, is the set of 
natural numbers that do not belong to A. If R is a relation, i.e., a set of n-tuples, then the 
complement of R, written R, is the set of n-tuples that are not members of R. 

The sets that are recursive (also called recursively decidable) can be defined as those sets 
or relations R such that both R and R are recursively enumerable. Here is why that is a 
reasonable definition: Suppose both R and R are recursively enumerable. In that case, 
we do have a method for deciding whether any given object n is a member ofR: 
Alternate between listing members of R and listing members of R . Eventually n will 
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show up on one list or the other. If it shows up on the enumeration of R, then it is a 
member of R. If it shows up on the enumeration of R , then it does not belong to R. 

You might find it salutary to contemplate the following fact. Suppose that S(x, y) is a 
2 -formula. Then the relation it expresses is recursive. But the set of numbers expressed 
by 3yS(x, y) may be only recursively enumerable. 

I said above that 2 sets are "decidable". (I will say more about that notion of 
decidability in the next section.) Now I can say that 2 sets are recursive (recursively 
decidable) in the sense just defined. Suppose a set A is 2 . Then there is a 2 -formula 
F(v) that expresses it. But the complement of A, A, is expressed by -> F(v), which will be 
2o too. But every 2 -formula is 2i. (Where F is any 2 -formula and U a variable that 
does not occur free in F, 3uF is a 2i-formula.) So both A and A are expressibly by 2i- 
formulas. So A is recursive. 

While we are defining concepts of recursiveness, I should add (since we need to know 
this later): A function fun is a recursive function if and only if the relation fun{%\, X2, 
x„) = x„+\ is a recursive relation. 

Church 's thesis 

The concepts of recursively enumerability and recursiveness (recursive decidability), 
which I have just defined in a precise way, are the formal, i.e., precise, counterparts to the 
informal concepts of effective enumerability and decidability. (There is quite a lot of 
variability in terminology in the literature; so watch out.) 

To say that a set is effectively enumerable, in the informal sense, is to say that there is 
some kind of step-by-step, mechanical, brainless, stupid procedure for generating a list of 
the members such that every member will eventually show up on the list. This is 
informal because I have not given a definition of "stupid". To say that a set is decidable 
is to say that there is some kind of step-by-step, mechanical, brainless, stupid procedure 
for deciding whether or not any given object is a member of the set. 

What is known as Church 's thesis (after the logician Alonzo Church) equates the 
informal with the formal notion: 
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Church 's thesis (two parts): 

(1) A set of natural numbers or relation on the natural numbers is effectively enumerable 
if and only if it is expressible by a 2i-formula (and hence recursively enumerable in 
the sense I defined). 

(2) A set of natural numbers or relation on the natural numbers is decidable if and only 
both it and its complement are expressible by a 2 1 -formula (and hence recursive in 
the sense I defined). 

The definition of recursive enumerability in terms of 2i sets is just one of several 
possible formally precise definitions. The reason that Church's thesis seems reasonable 
to people is that all of the known alternative precise definiations are demonstrably 
equivalent to one another. Another very important one uses the concept of a Turing 
machine. I will not say anything more about that in this course, but anyone who wants to 
claim an understanding of contemporary mathematical logic needs to know about Turing 
machines. So I highly recommend that you read the first three or four chapters of Boolos 
and Jeffrey on this subject. 

I have formulated Church's thesis as only a statement about sets of and relations on the 
natural numbers. It can be extended to other sorts of objects in so far as we have an 
algorithm for pairing numbers with those other sorts of objects. So, for example, it is 
evident that given any expression in the language of arithmetic, we can mechanically find 
its Godel number and that given any number we can mechanically determine which 
expression it is the Godel number of. 

So we could extend Church's thesis to include the following: A set of expressions is 
effectively enumerable if and only if the set of Godel numbers of the expressions in the 
set is expressible by a 2 1 -formula (recursively enumerable). And a set of expressions is 
decidable if and only if both the set of Godel numbers of the expressions in the set and 
the set of Godel numbers of expressions in the complement of the set are expressible by 
2i-formulas (i.e., the set of Godel numbers of expressions in the set is recursive). 

Assuming Church's thesis, then, we will pretty much equate the decidability of a set of 
expressions with the recursiveness of a set of numbers. For example, eventually we will 
see that first-order logic is undecidable. That is, we will claim that there is no algorithm 
by which we can decide whether or not a given formula of first-order logic is valid. But 
what we will actually prove is that the set consisting of the Godel numbers of valid 
sentences of first-order logic is not recursive (i.e., it is not the case that both that set and 
its complement are expressible by a 2 1 -formula). 
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Z-formulas and sets 

One more concept that we will need is that of a 2-formula and a 2 set or relation (no 
subscript). We define the 2-formulas by means of the following six statements: 

1 . Every 2o-formula is a 2-formula. 

2. If P is a 2-formula and v a variable, then 3vP is a 2-formula. 

3. If P is a 2-formula and n is a numeral and v and w are distinct variables, then all 
of the following are 2-formulas: (Vv < n )P, (3v < n)P, (Vv < w)P, (3v < w)P. 

4. If P and Q are 2-formulas, then (P v Q) and (P a Q) are 2-formulas. 
(Remember that we still use v and a as abbreviations.) 

5. If P is a 2o-formula (yes, the subscript is correct) and Q is a 2-formula, then 
(P -» Q) is a 2-formula. 

6. Nothing else is a 2-formula. 

A 2 set or relation is a set or relation expressible by a 2-formula. 

The significance of this concept to us is as follows: Because of the connection to the 
concept of recursive enumerability, we will often want to know that a certain set is 2i. 
But what we will directly show is only that it is 2. That will suffice to enable us to 
conclude that it is 2 1 because of the following fact: A set or relation is 2 if and only if it 
is 2i. For the proof of this fact (the 2 sets and relations are the 2i sets and relations), see 
Smullyan, pp. 50-53. It's not difficult, just complicated. 

For purposes of understanding the proof in Smullyan, you need the following further 
definition: Let Vi, v 2 , v 3 , ... be abbreviations for v*, v**, v***, respectively. A 
formula P is said to be regular if and only if for some n > 1 , the free variables in P are all 
of Vi, v 2 , ... v„. It might have been helpful for Smullyan to point out that if P is a 2- 
formula that is not regular, then, where n is the largest number such that v„ occurs free in 
P, then we can produce a regular 2-formula by writing: (Vi = Vi -» (v 2 = v 2 -» . . . -» 
(v„_i = M„.\ -* P). . . ). Bear that in mind when reading the last paragraph of Smullyan's 
proof. 

Although I didn't emphasize the fact when I introduced the concept, I have been 
assuming that the formulas that express sets are all regular in this sense. Thus, in the case 
of a formula containing v* and v** free, we know that the formula expresses a set of pairs 
(n, m) such that the formula is true when we put n in place of v* and m in place of v**. 
But a formula containing just v* and v*** free, and not v**, does not "express" anything. 
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Another thing you need to know in order to read Smullyan is that where I say (x\, X2, 
x„) G R, Smullyan writes R(xi, X2, x n ). 

Other sets of numbers 

The 2i sets are one kind of arithmetic set. Other kinds of arithmetic sets can be defined 
according to the complexity of the formulas of arithmetic that expresses them. 

If a set of numbers is recursive, then it is very "orderly". There is an algorithm for 
deciding whether or not any number belongs to it. If a set is recursively enumerable, 
though not recursive, then it is still somewhat orderly. There may be no algorithm for 
deciding whether or not a given member belongs, but at least there is an algorithm for 
generating a list of all of the members (such that every member eventually shows up on 
the list). 

Suppose, though, that A is not recursively enumerable but that S(x,y,z) is 2 and the 
formula Vy3zS(x,y,z) expresses A. Then A still has a kind of orderliness one step short 
of recursive enumerability. For each number n the set expressed by 3zS(x, n , z) is 
recursively enumerable. Such a set is called a TI 2 set. Suppose that B is not TI 2 , but 
T(x,y,z,w) is 2 and the formula 3yVz3wT(x,y,z,w) expresses B. Such as set is called 
23. We can define a whole hierarchy of sets according to the kinds of formulas of 
arithmetic that express them. In the table below, assume that each of the formulas S, S', 
S", etc., and R, R', R", etc., is 2 . 





2,- 


n, 


i = 


S(x) 


R(x) 


i= 1 


3yS'(x, y) 


VyR'(x, y) 


i = 2 


3yVzS"(x, y, z) 


Vy3zR"(x, y, z) 


i = 3 


3yVz3wS"'(x, y, z, w) 


Vy3zVwR"'(x, y, z, w) 









The technical term for what I have here called "orderliness" is "complexity". For each i 
and j, if j > i, then 2, (11/) sets are said to be "more complex" than the 2,- (IT) sets. A 
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whole branch of mathematical logic is devoted to the study of complexity. It is called 
recursion theory. 

It is not obvious, but as a matter of fact, every arithmetic set can be expressed by a 
formula that belongs somewhere in this table. So every arithmetic set is one of these 
kinds. In the next lesson, we will see that the set of (codes of) truths of arithmetic is not 
arithmetic at all. In other words, the set of (codes of) truths of arithmetic is not of any of 
these kinds. It is as "disorderly" as could possibly be. 



Lesson 7: Diagonalization 



From now on, instead of speaking of the "Godel number" of an expression in the 
language of arithmetic, I will usually speak of the "codes" of expressions in the language 
of arithmetic. (When stating major results, I will lapse back to "Godel number".) 

Consider the set consisting of all true formulas that can be written in the language of 
arithmetic, La. Call that set W .Wis the set of all "truths of arithmetic". To each 
member of Wthere is a corresponding code (Godel number). 

In this lesson, we will prove that the set of codes of formulas in Wis not arithmetic. In 
other words, there is no formula in the language of arithmetic that expresses it. This 
result is a significant fact in its own right. If the formulas in Wwere effectively 
enumerable, then, by Church's thesis, the set of codes of formulas in Wwould be 
recursively enumerable, which would mean that it was 2i. But the set of codes of 
formulas in Wis not expressible by a 2i-formula. So Wis not effectively enumerable, 
let alone decidable. There is no algorithm by which one can decide whether a sentence in 
the language of arithmetic is true, and there is no algorithm by which one can list, one 
after the other, all truths of arithmetic. 

We can go further. The set of codes of truths of arithmetic is as disorderly as a set of 
positive integers can possible get. The set of codes of members of Wis not expressible 
by any formula of the language of arithmetic at all! As I explained at the end of the last 
section, arithmetic sets can be placed in a hierarchy of orderliness or complexity. What 
we will show is that the set of codes of truths of arithmetic does not belong to any of 
these kinds. It is not arithmetic. This is called Tarski's undefinability theorem. 

Once we have reached that conclusion, you might already be able to sniff Godel 's first 
incompleteness theorem just around the corner: The set of codes of formulas that are 
provable in some theory of arithmetic, we will find, is not so disorderly. In fact, it is 2 1 
(recursively enumerable, very orderly). So there must be some discrepancy between the 
set of codes of provable formulas of arithmetic and the set of codes of truths of 
arithmetic. 

Concatenation to base 13 is Arithmetic. 

Suppose we take a string of symbols, such as vO (in this case a nonsense string), and 
append to it another string of symbols, such as fe-> . The result is a longer string of 
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symbols: vOi<^ . This operation is called concatenation. One might expect that there is a 
definite mathematical relation between the codes for two expressions and the code for the 
concatenation of those expressions. Indeed there is, and it is arithmetic. Indeed, it is 2o- 
We want to prove that fact by actually expressing that relation with a 2 -formula. 

The code of vO = 61 (in base 13, remember) = (6 x 13) + 1 (in base 10). 

The code of f<- = 4s7 (in base 13) = (4 x 13 2 ) + (1 1 x 13) + 7 (in base 10 scientific 

notation). 

The code of v0f<- = 614e7 (in base 13) 

= (6 x 13 4 ) + (1 x 13 3 ) + (4 x 13 2 ) + (1 1 x 13) + 7 (in base 10) 

= ((6 x 13) + 1) x 13 3 ) + (4 x 13 2 ) + (1 1 x 13) + 7 (in base 10) 

= (61 x 10 ) + 4e7 (in base 13 scientific notation). 

(Remember that "10" in base 13 denotes what "13" in base 10 denotes.) 

Similarly, the code of vO" = 6100 (in base 13) = (6 x 13 3 ) + (1 x 13 2 ) + (0 x 13) + (in 

base 10) = (6 x 13 3 ) + 13 2 . 

The code of = = r| (in base 13) = 10 (in base 10). 

The code of v0"= = 6100ri (in base 13) 

= (6 x 13 4 ) + (1 x 13 3 ) + (0 x 13 2 ) + (0 x 13) + 10 (in base 10) 

= ((6 x 13 3 ) + 13 2 ) x 13) + 10 (in base 10) 

= (6100 x 10) + r| (in base 13). 

Examining the above two examples, we detect a pattern: 

Suppose that n is written as m digits in base 13. In other words, the number of digits in 
the base 13 numeral denoting n is m. For example, 4e7 is written as 3 digits in base 13. 
We say that m is the length of n. So the length of 4e7 is 3. In general, let l(n) be the 
length of n written in base 13 — the number of digits in the base 13 numeral denoting n. 

In general, we can see that if m is the code of an expression e\ and n is the code of 
another expression e2, then the code for the result of concatenating e\ with e2 (ei written 
first) is (m x 10'^) + n (writing in base 13). We call this relation concatenation to base 
13. (Notice that while concatenation is a relation between expressions, concatentation to 
base 13 is a relation between numbers). 

Let us abbreviate (m x \0 1(n) ) + n thus: m * n. 

(Don't confuse this star with the vocabulary item of La.) 



Proposition 1: The relation expressed by x * y = z is arithmetic; indeed it is 2o- 
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Proof: 

Let x < y abbreviate x ^ y a x ^ y (which is already an abbreviation). Let us use base 
13 to abbreviate numerals of La. For example, r\ abbreviates 0""""" (that's ten 
primes), and 10 abbreviates 0""""""' (that's thirteen primes). 

Notice that for any positive number n, l(n), the length of the base 13 numeral 
denoting n is simply the smallest number k such that (writing in base 13) 10* is 
greater than n. For example, the length of "487" is 3, and (writing in base 13) 10 is 
the smallest power of 10 greater than 4s7. So 10 /(J,) = x if and only if x is the smallest 
power of 10 greater than y and 1. 

1 . Consider the relation: x divides y (i.e., y divided by x is a whole number). That 
relation is expressed by: (3z < y)((x • z) = y). Abbreviate that as x div y. 

2. Consider the set of numbers x such that for some y, x = 13^ (writing in base 10, the 
set of powers of 13). That set is expressed by the following formula. (This only 
works because 13 is a prime number.) 

(Vz < x)((z div x a z * 1 ) 10 div z) 
Abbreviate that formula as Pow(x). 

3. Consider the relation whose members are pairs (x, y) such that x is the smallest 
power of 13 greater than y and 1 (writing in base 10). That relation is arithmetic; 
it is expressed by: 

(Pow(x) a y < x a 1 < x) a (Vz < x)((Pow(z) a 1 < z) -»> z < y)) 
Abbreviate that formula as Small (x, y). 

4. Consider the relation whose members are pairs (x, y) such that 10 /w = x (writing 
in base 13). By what I pointed out above, that relation is arithmetic; it is 
expressed by: 

(y = a x = 10) v (y t a Small(x, y)) 
Abbreviate this as 10 /(y) = x. 

5. Finally, the set of triples (x, y, z) such that (x x 10 1(y) ) + y = z, i.e., x * y = z, is 
arithmetic; it is expressed by: 

(3u < z)(3v < z)(10 ,(y) = u a (x • u) = v a (v + y) = z). 
Abbreviate this as Concat 2 (x, y, z). 



End of proof 
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Let (x * y * z) abbreviate ((x * y) * z). 

Corollary: For each n > 2, the relation (x\ * xi * ... * x«) = y is arithmetic, indeed 2o- 

Proof: By induction. 

Basis: We have proved this for n = 2. 

Let Concat2(x-i, X2, y) be an abbeviation of the formula that expresses (x\ * xi) =y. 
Induction hypothesis: The thesis holds for n = m. 

Induction step: Show that the thesis holds for n = m + 1 . Let Concat m (xi , . . . , x m , y) 
be an abbeviation of the formula that expresses {x\ * xi * ... * x m ) = y. Show that the 
following formula expresses (xi * X2* ... x m * x m +\) = y. 
The requisite formula is as follows: 

(3z < y)(Concat m (xi, x m , z) a Concat 2 (z, x m+1 , y)) 

Note: We have proved not only that (x\ * X2 * ... * x„) = y is arithmetic, but also that it is 
2o (recursive), because the only quantifiers used were bounded quantifiers (but the bound 
may be a variable). 

Some Additional Arithmetization Results 

This section continues the work of the previous section in demonstrating that a number of 
important relations are arithmetic, indeed 2 . These results will not seem very 
interesting, but they will be used at several junctures in what follows. 

6. Begins. We want a formula that expresses the relation that holds between numbers x 
and y if and only if x is the code for an expression that is the initial segment of the 
expression that y is the code for. For example, since Vv*0 is an initial segment of 
Vv*0<v* and 9651 is the code of Vv*0 and 9651s65 is the code of Vv*0<v*, 9651 
stands in this relation to9651e65. We call this the Begins relation. We have already 
defined Pow(x) and Concat2(x-i, X2, y) as abbreviations of 2o formulas of arithmetic. 
Consequently, the following formula expresses the Begins relation: 

(x = y v (x t a (3z < y)(3w < y)(3u < y)(Pow(w) a (x • w) = u a 
Concat 2 (u, z, y))) 
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For example, 5 begins 5007 because there is a power of 13, namely 100 (which is 13 
written in base 13), such that 5007 = (5 • 100) * 7. (Think of 5 as x, 100 as w, 500 as 
u, 7 as z and 5007 as y.) Abbreviate this formula thus: xBy 

7. Ends. Similarly, we want a formula that expresses the relation between x and y when 
the numeral forx is a final segment of the numeral for y. That formula is: 

(x = y v (3z < y)(Concat 2 (z, x, y))) 
Abbreviate this formula thus: xEy 

8. Part of. This expresses the relation between x and y when the numeral for x ends 
some numeral that begins the numeral for y. For example, e2 is part of 5s294, since 
zl ends 5e2 which begins 5e294. This relation expressed by: 

(3z < y)(xEz a zBy) 

Abbreviate this: xPy 

From now on I will simply write xy = z as an abbreviation for Concat2(x, y, z) (which, 
recall, abbreviates the formula that expresses the relation that holds between three 
numbers x, y and z if and only if x * y = z), and xwy = z as an abbreviation of Concat3(x, 
w, y, z). 

Also, I will let XiX 2 ...x n Py abbreviate (3z < y)(Concat n (x-i, x 2 , x n , z) a zPy). 



Exponentiation is Si 

The relation x? = z is arithmetic. Indeed, it is Hi. What matters for Lesson 8 will be only 
that exponentiation is arithmetic. But in Lesson 10, we will want to know that it is 2i as 
well. 

Toward showing that the relation j? = z is arithmetic we first need to make an observation 
about it that will give us the hint we need in order to write the formula that expresses it. 

Proposition: x v = z if and only if there exists a set S of ordered pairs such that: 

(i) <y,z)es, 

(ii) For every pair {a, b) G S, either {a, b) = (0, 1) or there is some pair {c,d)^S such 
that (a,b) = {c+ 1 , d • x). 
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Proof: 

Left-to-Right: lfx y = z, then we can take S to be the set {(0,1), (l,x), (2, x 2 ), . . . , (y, x 5 }}. 
If we do that, then (i) obviously, (y, z) = (y, x 7 ) G S, and (ii) for pair (a, b) G S, either (a, 
b) = (0, 1) or there is some pair (c, d) G S such that (a,b) = (c+ 1, d • x). (For example, 
(2, x 2 ) is not (0, 1), but (1, x) G S and (2, x 2 ) = (1 + 1, x • x). 

Right-to-Left: Let 5* be any set of ordered pairs satisfying (i) and (ii). 

We prove, by induction on a that for all positive integers a, if there is an integer b such 
that (a, b) G S, thenx" = b. Basis: a = 0. If (a, b) G S and b=\, then x = 1. If (a, 6) = 
(0, b) G 5*, then it cannot happen that b^ 1, because there is no c such that c + 1 = 0. 
Induction hypothesis: Suppose for arbitrary c, if there is an integer d such that (c, J) G S, 
then x c = J. Induction step: Show that if there is a 6 such that (c + 1 , b) G S, then x c + 1 = 
6. By the definition of S, if (c +1, 6) G 5*, then there is a J such that (c, d) G 5* and (c +1, 
6) = (c + 1, d • x). By IH, x c = d. So x c+ 1 = d • x = 6. 
So, given that by (1), (y, z) G 5*, it follows that x y = z. 

End of proof 

Now suppose we can find a 2o-formula K(y, z, w) that has the following property: For 
any finite sequence of ordered pairs of numbers P = {{a\, b\), (a2, 62), ■ • • , b n )), there 
are numbers y and z such that y < w, z < w and (y,z)GP if and only if (y, z, w) satisfies 
K(y, z, w). 

If we can find such a formula K(y, z, w), then for each x and each w, we can think of the 
set of ordered pairs (y, z) such that (y, z, w) satisfies K(y, z, w) as the set S in the above 
proposition. Accordingly, the following 2-formula expresses the relation x y = z: 

3w(K(y, z, w) a (Va < w)(Vb < w)(K(a, b, w) 

((a = a b =1 ) v (3c < a)(3d < b)(K(c, d,w)Aa = c+nb = d« x)))) 
Let the formula z = (x Exp y) abbreviate this formula. 

(Quiz: Why does K(y, z, w) have to be 2 ?) So it remains to find the formula K(y, z, w) 
that does the job. Here we're going to use some tricks. The following formula defines 
the set of codes of terms that are expressed in base 13 notation as strings of 1 's: 



x t a (Vy < x)(yPx 1 Py) 
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(We stipulate that x is not 0, because as we have defined the formula P, nothing is part of 
0.) Abbreviate this formula thus: Ones(x). 

Where z is (a numeral for a number expressed in base 13 notation as) a string of 1 's, say 
that the number denoted by 2z2 is a frame. Say that x is a maximal frame of y if and only 
if x is a frame, the numeral denoting x is a part of the numeral denoting y, and x is as long 
as any frame that is a part of y. For example, the number denoted by 21 1 12 is a maximal 
frame of the number denoted by 5e0321 1 1 292521 1 2. The following 2 -formula 
expresses the relation of being a maximal frame: 

xPy a (3z < y)(Ones(z) a x = 2z2 a -(3w < y)(Ones(w) a 2zw2Py)) 

Abbreviate this as xMFy. 

The desired formula K(y, z, w) can be written as follows: 
(3u ^ w)(uMFw a uuyuzuuPw a ->uPy a ->uPz) 

This does the job, because we can think of w as denoting a sequence of pairs (by analogy, 
not by Godel-numbering); we can think of UU as separating members of the sequence; we 
can we think of a single U as separating the members of the pairs that are members of the 
sequence. Since U denotes a maximal frame of the number that w denotes, and the 
number that u denotes is not a part of either the number that y denotes or the number that 
z denotes, we can be sure that all of the pairs in the sequence that we are thinking of w as 
denoting are included in this way. In other words, on analogy to the proof of Proposition 
above, we could prove that z = ( x Exp y ) if and only ifx y = z. 

Diagonalization is Arithmetic 

For any expression, we will define an expression that we will call the diagonal of that 
expression. Likewise, we will define a relation on numbers x and y that holds just in case 
x is the code of an expression and y is the code of the diagonal of that expression. 
Finally, we will show that that relation is arithmetic. 

Let E be an expression (maybe a formula, maybe not). Where v is a particular variable of 
the language (let's say v*), and n is any number, Vv(v = n -» E) is, of course, another 
expression (which is a formula if and only if E is a formula). 
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For any expression E, let #(E) be the code of that expression. 

Define a two-place function rep as follows: Where e is the code of E, rep{e, n) is the 
code of Vv(v = n — * E). Call this the representation function. 
For example, E might be the expression 2 ^ v. Then, since 0"' is the numeral that 
denotes the number 3, rep(#(2 < v), 3) = #(Vv(v = 0"' -» 2 < v)). 

Define a one-place function diag as follows: For all numbers x, diag{x) = repix, x). 
The numeral that denotes #(2 < v) is #(2<v) . 

So, for example, diag(#(2 < v)) = rep(#(2 < v), #(2 < v)) = #(Vv(v = #(2<v) -> 2 < v)). 

So, in words, diag(x) is the code for the formula that results from taking the expression E 

for which x is the code, and putting the numeral for that code in place of n in 

Vv(v = n -> E). Where n is the numeral that denotes the code for E, Vv(v = n -» E) is 

the diagonal for E. In other words, the diagonal for E is Vv(v = #E -*- E), and 
diag(#(E)) is the code for that formula. Be sure to distinguish between the diagonal for 
E, which is an expression (a formula if E is a formula), and diag(#(E)), which is a 
number. 

Suppose we confine our attention to the case in which E is a formula containing v as it 
sole free variable. Suppose also that we interpret Vv(v = n -» E) as saying, "the code 
for n satisfies E". The the diagonal of E is a sentence is a sentence that says, "my code 
satisfies me". If we furthermore ignore the distinction between codes and the expressions 
they are codes for, then the diagonal of E in effect says, "I satisfy myself." 

To understand why diag is called the diagonal function, contemplate the following table, 
representing the inputs to rep. rep becomes diag when its inputs are restricted to those on 
the diagonal of this table: 





y 


1 


2 


3 




X 












1 




<u> 


0,2) 


0,3) 




2 




(2, 1) 


^2 ^ 2^ 


(2,3) 




3 




(3, 1) 


(3,2) 




... 
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Proposition 2: The function rep is arithmetic. 

Proof: Observe that each of the components of Vv(v = n -» E) has a definite code: 
The code of Vv(v = is 965265r] (written in base 13), the code of n is 10 n (remember 
that point from Lesson 6), the code of -> is 8, the code of E is e, and the code of ) is 
3. So the code of Vv(v = n E) is (965265ri * 10" * 8 * e * 3). 

So the relation rep(e, n) = y is expressed by the following 2 formula: 

3w(w = (10 Exp n) a Concat 5 (965265ri, w, 8, e, 3, y)). 

Proposition 3: The function diag is arithmetic. 

Proof: The relation diag{x) = y is expressed by the following 2 formula: 

3w(w = (10 Exp x) a Concat 5 (965265ri, w, 8, x, 3, y)). 

Let Diag(x, y) abbreviate the formula that expresses the relation diagix) =y. 

Note for later purposes: Since z = (x Exp y) is 2, we can be sure that Diag(x, y) is 2 and 
therefore that the diag relation is 2i (recursively enumerable). 

Definition: For any set of numbers A, let A * be the set of all n such that diag(n) G A. 
(So if we find the diagonal of a code in A, then we put the code that it is the diagonal of 
in A *. Again ignoring the distinction between codes and formulas, if there is a formula in 
A that says, "I satisfy myself!", then we put the (code for the) expression that is thus said 
to satisfy itself in A*.) 

Lemma 1: If A is arithmetic, then A * is arithmetic. 

Proof: Let A(y) be the formula that expresses A. Then the following formula 
expresses .4*: 

3y(Diag(x, y) a A(y)). 

Definition of Godel sentences: Where A is a set of numbers, G is a Godel sentence for A 
if and only if: G is true if and only if the code for (i.e., the Godel number of) G is in A. 

For short: G is true #(G) El A. 
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The Godel Diagonal Lemma (lower): It A is arithmetic, then there is a Godel sentence 
for A. (This is not the Godel incompleteness theorem; we're still some distance from 
that. But this is itself one of the great facts of 20th century mathematical logic.) 

Proof: Suppose A is arithmetic. By Lemma 1, A* is arithmetic. LetH(v)bethe 
formula that expresses A*. Let h be the code of H(v), and h the numeral denoting h. 
Then: 

Vv(v = h -» H(v)) is true if and only if H( h ) is true (by first-order logic). 
H( h ) is true if and only ifhtEA* (because H(v) expresses A*), 
h G A* if and only if diagQi) G A (by the definition of A *). 

So: Vv(v = h -*- H(v)) is true if and only if diagQi) G A. 

But diag(h) is the code for Vv(v = h -> H(v)). So Vv(v = h -» H(v)) is a Godel 
sentence for A. 

I call this the "Lower Diagonal Lemma" (a term I just made up) because it is a 
consequence of a more general theorem that I will introduce later on (Lesson 10), which I 
will call the "Upper Diagonal Lemma". 

Lemma 2: If a set A is arithmetic, then so is its complement (the set of numbers not in A), 
A . Proof: The negation of the formula that expresses A expresses A . 

The Undefinability of Arithmetic (Tarski's Undefinability Theorem) (drum roll, trumpets): 
The set of codes of formulas in .Wis not arithmetic. 

In other words, the set of Godel numbers of the true formulas of La is not arithmetic. 
Proof: 

Let 7 be the set of codes of true formulas of La. 

Suppose, for a reductio, that T is arithmetic. 

Then by Lemma 2, the complement of T, f, is arithmetic too. 

( T comprises the codes of expressions of La that are not formulas and the codes of 

formulas of La that are not true.) 

By the Godel Diagonal Lemma, there is a Godel sentence G for T. 
Since G is a Godel sentence for T, G is true if and only if #(G) G T. 
So G is true if and only if it is not true. Contradiction! 



Lesson 8: Arithmetization of Syntax and the First 
Incompleteness Theorem 



In this lesson I will define a theory of arithmetic and demonstrate that the set of codes of 
sentences that are provable in that theory is 2i (and therefore arithmetic). Godel's first 
incompleteness theorem is just a short step beyond that. 

Recall that by a theory I just mean a set of formulas. (Formerly I have said that a theory 
was a set of sentences — no free variables. But now that we understand that formulas can 
be true, if they are satisfied by every variable assignment, we can allow that theories 
include formulas with free variables.) Smullyan uses the term "system" where I say 
"theory". Other people, e.g., Boolos and Jeffrey, use the term "theory" to mean set of 
formulas closed under logical consequence. (So if A is a theory and A \- Q, then Q G A.) 
But I will not assume that every theory is closed under logical consequence. 

If Th is a theory in some language and Q is a formula of that language, we will say that 
there is a proof of Q in Th if and only if there exists a finite sequence of formulas (in the 
pertinent language) such that every member of the sequence is either an axiom of QL 
(remember that concept from Lesson 5) or a member of Th or follows from previous lines 
by Modus Ponens or Generalization. 

If there is a proof of Q in Th, then we will say that Q is a theorem of Th. (This is often 
symbolized thus: |- n Q) The set comprising the theorems of a theory Th is designated 
thus: Con(7%). (It does not matter now whether we think of consequence as semantic 
consequence or syntactic consequence; for given soundness and completeness — which 
we proved, but only for a different system — these two concepts of consequence are co- 
extensive.) 

If Th is an effectively decidable set of formulas (so that, by Church's thesis, both the set 
of codes of Th and the set of codes of the complement of Th are then Th is an axiom 
system and the members of Th together with the axioms of QL are axioms. (Note well: It 
can happen that Th is decidable although Con( Th) is not.) 

Peano Arithmetic 

Peano arithmetic (after Giuseppe Peano, although he did not invent it), or P.A., for short, 
is a theory in the language of arithmetic that consists of nine specific formulas and also 
all of the infinitely many formulas that fit the form of one particular axiom scheme. This 
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theory qualifies as an axiom system because there is an algorithm for deciding whether or 
not a formula is a member of the theory: Check whether it is one of the nine specific 
formulas and whether it has the form of the axiom scheme. 



The nine axioms of P. A. 



Nl 


(x' = y' x = y) 


N2 


-0 = x' 


N3 


(x + 0) = x 


N4 


(x + y') = (x + y)' 


N5 


(x • 0) = 


N6 


(x • y') = ((x • y) + x) 


N7 


(x < ** x = 0) 


N8 


(x<y'«(x<yvx = y')) 


N9 


(x < y v y < x) 



Think of x as an abbreviation of v* and think of y as an abbreviation of v**. If you look 
at these nine formulas, you will readily recognize that they are all true. That is, they are 
all satisfied by every variable assignment in the intended interpretation of the language of 
arithmetic. 

The induction scheme 

Where F is any formula (in the language of arithmetic) (containing perhaps variable x 
free, as well as perhaps other free variables), and v is any variable that does not occur in 
F, let F v [y] (notice the square brackets) abbreviate a formula of the following form: 

Vv(v = y -» Vx(x = v F)) 

(So the subscript on F v [y] indicates a variable that does not occur in F.) Then every 
formula of the following form is an axiom of P.A.: 

N10: (F v [0] -* (Vx(F — F v [x']) VxF)) 

Dispensing with the abbreviation, we can write out N10 in full as follows (remember that 
v is not in F at all): 

(Vv(v = -* Vx(x = v -» F)) -» (Vx(F -* Vv(v = x' — Vx(x = v -» F))) -* VxF)). 

You can recognize this as stating a principle of induction as follows: 
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Vv(v = -» Vx(x = v -*- F)): This is equivalent to Vx(x = -* F) and says that 
satisfies F. 

Vx(F -> Vv(v = x' — > Vx(x = v -> F))): This says that \fx satisfies F, then the successor 
of x satisfies F. (Notice that the second occurrence of "Vx" "cuts off the first one, so 
that only the occurrence of "x" in "x"' is in the scope of the first occurrence of "Vx".) 
VxF: This says, of course, that everything satisfies F. 

Good question: Why don't we just use the following as the axiom scheme? 
(FO/x (Vx(F -* FxVx) VxFx)) 

Answer: Because the more complicated one is easier to arithmetize. That is, it is easier 
to show that the set of codes of such formulas is expressible by a formula in the language 
of arithmetic. 

The Arithmetization of Proof 

Our objective is to find a formula of the language of arithmetic that expresses the set of 
codes of formulas that are provable in P. A. We will do this by defining a series of 
abbreviations of formulas that will be used in building up the target formula. We began 
the necessary series of definitions in the previous lesson, in the definitions of Pow(w) (w 
is a power of 13), xy = z (concatenation of two expressions), X1X2, . . .x n = z 
(concatenation of n expressions), xBy (begins), xEy (ends), xPy (part of) and 
xix 2 ...x n Py. 

1 . Sequences. Recall that the symbol # is used to represent sequences of formulas and 
that it has Godel number 6 (the base 13 digit for the number twelve). We want to find 
a formula that expresses the fact that x is the code for a sequence, not the code for a 
formula or other expression. Let K n be the set of codes whose base 13 numerals do 
not contain 5. The members of K\\ are codes of expressions that are not sequences. 
If (n\, U2, n m ) is a sequence of numbers in K n , then bn\bn2& . . . bn m b (i.e., the 
number that is written this way in base 13, substituting the numeral for n\ for "n", 
etc.) is a sequence number for the sequence {n\, ni, n m ). The formula that 
expresses the set of sequence numbers x is: 

(SBx a 6Ex a 6 t x a -66Px a (Vy < x)(60yPx 5By)) 



Abbreviate this: Seq(x) 
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To understand the last conjunct of Seq(x), recall, from Lesson 5, that, while is the 
Godel number of the prime symbol, ', we do not assign Godel numbers to expressions 
of more than one symbol that begin with a prime. So a sequence can contain the 
expression #'# but not, for example, #"-> v#. So the number 505 can be part of a 
sequence number, but the number 600766 cannot be part of a sequence number. The 
last conjunct in Seq(x) secures that result. 

2. Membership in a sequence. There is a relation that holds between x and y if and only 
if y is a code for a sequence (a sequence number) and x is the code for a member of 
that sequence. The following formula expresses that relation: 

(Seq(y) a 6x6Py a -6Px) 
Abbreviate this: x In y 

From now on, (Vx In y) ... abbreviates (Vx < y)(x In y -* . . . ). Notice that this makes 
(Vx In y) a bounded quantifier. 

3. Earlier in a sequence. We want a formula that expresses the relation between x, y 
and z when z is the sequence number for a sequence and the expression for which x is 
the code is earlier in that sequence than the expression for which y is the code: 

(x In z a y In z a (3w < z)(wBz a x in w a -> y In w) 

Abbreviate this: x -< y 

z 

From now on, (3z,w -< y) ... abbreviates 3z3w(z -iyAW^yA ...). 

X XX 

4. Formation rules: We want a formula that expresses the relation between x, y and z 
that holds when x and y are the codes of expressions E x and E y and z is the code for 
the expression (E x -» E^,). The following formula expresses that relation: 

2x8y3 = z 

(Remember that 2 is the code for (, 8 the code for and 3 the code for ).) 
Abbreviate this as x imp y = z. Similarly, the relations between x, y and z when z is 
the code for (E x + E^,), (E x • E y ), E x = E y , or E x < E y can be expressed by formulas 
that we abbreviate as x pi y = z, x tim y = z, x id y = z, and x le y = z, respectively. 
And the relations that hold between x and y when y is the code for -> E x or E x can be 
expressed by formulas that we abbreviate as neg(x) = y and s(x) = y, respectively. 
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From now on, I first give the abbreviation, I then state in English the relation to be 
expressed, and I then give the formula that expresses it. 

5. St(x): x is the code of a string of stars (asterisks): (Vy < x)(yPx -> 5Py) 

6. Var(x): x is the code for a variable: (3y < x)(St(y) a 6y = x) 

7. Num(x): x is the code of a numeral. We already have an abbreviation for this, 
Pow(x), but in this context it might be helpful to use a different mnemonic. (Recall 
that the numerals have as their codes, 1, 10, 100, 1000, etc. (writing in base 13).) 

Observe that we can represent the grammatical construction of a term as a sequence that 
starts with variables or numerals or some of both, forms terms from them, forms terms 
from those terms, and so on. For example, we can represent the construction of the term 
((x • 0') + y) as follows: (0', x, y, (x • 0'), ((x • 0') + y), ((x • 0') + y)'). 

8. TermOp(x, y, z): z is the code for an expression that results from applying one of 
the basic term-forming operations to the expressions for which x and y are codes: 

(x pi y = z v x tim y = z v s(x) = z) 

9. TermSeq(x): x is the sequence number for a sequence of numbers representing the 
formation of a term: 

(Seq(x) a (Vy In x)(Var(y) v Num(y) v (3z,w < y)TermOp(z, w, y))) 

X 

For example, the code for the following expression will satisfy TermSeq(x): 
#x#0'#f*(x0')#0"#f**(0"f*(x0'))# 

10. Term(x): x is the code for a term: 3y(TermSeq(y) a x In y). 
Notice that this is a 2i-formula, but not a 2 -formula. 

1 1 . Atom(x): x is the code of an atomic formula: 

(3y < x)(3z < x)(Term(y) a Term(z) a (y id z = x v y le z = x)) 

12. Gen(x, y): y is the code of a universal quantification of the formula whose code is 
x: (3z < y)(Var(z) a 9zx = y) Notice that this doubles as a formation rule and an 
inference rule in our system. 

At this point, observe that formulas can be built up from other formulas by any of three 
formula-building operations: Adding a negation sign, inserting an arrow between two 
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formulas and putting parentheses on the outside, or adding a universal quantifier. For 
example, we can represent the construction of the formula Vx(Fx -» Gx) as follows: 
(Fx, Gx, (Fx Gx), Vx(Fx Gx)) 

13. FormOp(x, y, z): z is the code for an expression that results from applying one of 
the three formula-forming operations to the expressions for which x and y are codes: 
x imp y = z v neg(x) = z v Gen(x, z) 

14. FormSeq(x): x is the sequence number for a sequence of codes representing the 
formation of a formula: 

(Seq(x) a (Vy In x)(Atom(y) v (3z,w < y)FormOp(z, w, y))) 

X 

For example, the code for the following expression will satisfy FormSeq(x): 
#v*=0'#v*=0"#-v*=0"#(v*=0'^v*=0")# 

15. Form(x): x is the code for a formula in the language of arithmetic: 
3y(FormSeq(y) a x In y) 

Notice that this is a 2i-formula, but not a 2 -formula. 

16. Ax(x): x is the code of an axiom . . . Let's skip this for the moment and come back to 
it later. 

17. x imp z = y: We have already defined this, back at step 4, but I mention it again, 
because it also serves define the relation: z is the code for an expression derivable 
by Modus Ponens from the expressions for which x and y are the codes 

18. Der(x, y, z): z is the code for an expression that is derivable by either Modus 
Ponens or Generalization from the expressions for which x and y are codes: 

x imp z = y v Gen(x, z) 

19. Pf(x): x is the sequence number of a proof in Peano arithmetic: 
(Seq(x) a (Vy In x)(Ax(y) v (3z,w < y)Der(z, w, y))) 

X 

20. Prov(x): x the code of a formula provable in P.A.: 3y(Pf(y) a x In y). 

Ta da! Prov(x) is a formula that expresses the set whose members are all and only the 
codes of formulas that are theorems of Peano arithmetic. (It is a 2 1 -formula, but not a 2 - 
formula.) And if you want, by cashing in all the abbreviations, you can even write it out 
(but there are better ways to spend your time). Except for one thing. . . We still have not 
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done item #16 above. We need to show that there is a formula that expresses the set of 
axioms (the usual axioms of QL and the special axioms of Peano arithmetic). 

In Lesson 5, we encountered the seven axiom schemes of QL. In this lesson we have 
encountered the nine axioms of Peano arithmetic and the induction scheme of Peano 
arithmetic. This gives us seventeen axioms or axiom schemes. Suppose L1 (x) is a 
formula that expresses the set of numbers x such that x is the code of an axiom of type 
LI, and L2(x) is a formula that expresses the set of numbers x such thatx is the code of 
an axiom of type L2, . . ., and N1 (x) is a formula that expresses the set whose sole 
member is the code for Nl, and N2(x) is a formula that expresses the set whose sole 
member is the code for N2, and and, finally, N10(x) is a formula that expresses the 
set whose members are codes of formulas of the type N10 (the induction scheme). Then 
a formula that expresses the set of numbers x such that x is the code for an axiom is: 

(L1(x) v L2(x) v ... v N1(x) v N2(x) v ... v N12(x)) 

This is the formula that we abbreviate as Ax(x). I won't bother to write out all nineteen 
of these disjuncts, but here are a few examples, including the hard cases: 

L1 (x) is: (3y < x)(3w < x)(3z < x) (Form(x) a w imp y = z a y imp z = x). 

L7(z): For this one, observe that and axiom of this form can be written in the form 
(v = t -» (X1VX2 -» X-|tX2). (So, in X1VX2, X1 is the part of the formula that comes 
before v, and X2 is the part of the formula that comes after v.) Then the formula that 
expresses the codes of axioms of type L7 can be expressed thus: 

(3u < z)(3t < z)(3p < z)(3q < z)(3x < z)(3y < z) (3w < z) (Var(u) a Term(t) a 
Form(p) a Form(q) a xuy = p a xty = q a p imp q = w a 2urit8w3 = z). 

N1(x): Let n1 be the numeral that denotes the code of axiom Nl. Then the formula that 
expresses the set containing the code of axiom Nl is simply: x = n1 . 

N10(z): First we identify a formula Ef(v, y, f, w) that expresses the set of quadruples of 
codes representing variables v and y and formulas F and formulas of form Vv(v = y -» 
Vx(x = v -> F)), respectively, thus: 



(3x < w)(Var(v) a Var(y) a Var(x) a Form(f) a -vPf a 9v2vriy89x2xriv8f33 = w). 
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Then the formula that N1 0(z) abbreviates (satisified by the code for any instance of the 
induction scheme) is as follows: 

(3v < z)(3x < z)(3f < z)(3wi < z)(3w 2 < z) 

(Ef(v, 0, f, wi) a Ef(v, x', f, w 2 ) a 2w 1 829x2f8w 2 389xf33 = z). 

Thus, we have proved: 

The Arithetization of Proof: The set of codes of provable formulas (provable in P. A.) is 
arithmetic. 

This is so, because Prov(x) is a formula of La. Indeed, we can say something stronger: 
The set of codes of provable formulas (provable in P.A.) is a 2i set, because Prov(x) is a 
2-formula. That is evident because Prov(x) is 3y(Pf(y) a x In y), and (Pf(y) a x In y) is 
2. To see that (Pf(y) a x In y) is 2, review the construction and note that we start with 2 
formulas and compose new formulas only in ways that conform to the definition of a 2- 
formula. Since Prov(x) is a 2-formula, and (by Smullyan's theorem) the set of codes of 
provable formulas is 2i, i.e., recursively enumerable, and the set of provable formulas is 
effectively enumerable. 

Godel 's First Incompleteness Theorem (first formulation): There are true formulas in La 
that are not theorems of P.A. 

First Proof (using the Undefinability of Arithmetic): 

By the Undefinability of Arithmetic, the set of codes of truths of arithmetic is not 
arithmetic. We have seen that the set of codes of theorems of P.A. is 2i, and 
therefore, certainly, arithmetic. So, where .Wis the set of truths of arithmetic, N± 
Con(P.A). 

Case 1: There is a formula of arithmetic Q such that Q G Con(P.A.) and Q N. 
But presumably all of the theorems of P.A. are true. So this can't be right. We are 
left with: 

Case 2: There is a formula of arithmetic Q such that Q Con(P.A.) and Q G N. Q 
is a truth in La that is not a theorem of P.A. 
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Second Proof X applying Godel's Diagonal Lemma to the set of codes of unprovable 
formulas): 

Let P be the set of Godel numbers of theorems of P. A. and P be the complement of 
that set, viz., the set comprising the codes of expressions that are not formulas and the 
codes of formulas in the language of arithmetic that are not theorems of P. A. 

The formula Prov(x) defined above expresses P. So the formula -> Prov(x) 
expresses P. So P is arithmetic too. So by the Godel Diagonal Lemma, there is a 
Godel sentence G for P . So G is true if and only if #(G) G P . 

Case 1: G is false and #(G) P . In that case, #(G) G P, which means that G is a 
theorem of PA. But presumably, no falsehoods are theorems of PA. So we are left 
with: 

Case 2: G is true and #(G) G P , which means that G is not provable in PA. So 
some truths of arithmetic are not provable in PA. 

Note: In this second proof, there is no mystery about what G says. If you wished, you 
could write it out. P is arithmetic; so by Lemma 1 from Lesson 7, P* is arithmetic. The 
formula that expresses P * is 3y(Diag(x, y) a -> Prov(y)). Abbreviate this as K(x). Let k 

be the code of K(x), and k the numeral denoting k. Then the Godel G sentence for P is 
Vx(x = k -» K(x)). (See the proof of the Godel Diagonal Lemma.) Since G is true if 
and only if its code belongs to the set of codes of expressions that are not formulas and 
formulas that are not provable, popular expositions of Godel's theorem often report that 
Godel finds a sentence that says, "I am not provable". But if you think carefully about 
the meaning of K(x), it's actually not very easy to think of G as saying that, for K(x) does 
not express P but P*. Ignoring the difference between Vx(x = k -* K(x)) and K(k), 
what G says is something more like this: "My code is in the set of codes of formulas the 
codes of whose diagonals are in the set of codes of expressions that either are not codes 
of formulas or are codes of formulas that are not provable." or, more briefly, "The 
diagonal of my code is not the code of a provable formula." 

Note also: We have proved that G really is true (assuming that PA. is true)! (How can 
we know this if we cannot derive it from PA.?) 
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Why is this theorem called Godel 's incompleteness theorem? 

Definition: Say that a theory in a language is complete if and only if for every sentence 
in that language either it or its negation is a theorem of the theory. (We do not require 
that every formula or its negation be true. For example, we cannot assume that either 
x = 5 or x ^ 5 is true. That would be like assuming that either Vx x = 5 or Vx x ^ 5 is 
true.) (Obviously, this a different sense of "complete" than we speak of in saying that our 
deductive calculus is "complete". We did, however, encountered this notion of 
completeness in Lesson 3.) 

Godel 's First Incompleteness Theorem (second formulation): P.A. is incomplete. 

Proof: G, in the second proof above, is a sentence. As we have seen, G is true and 
not a theorem of P.A. But every theorem of P.A. is true; so if -> G is a theorem of 
PA., then -> G is true, which means that G is false, contrary to what we have seen. So 
-> G is not a theorem of P.A. either. So neither G nor -> G is a theorem of P.A. So 
P.A. is incomplete. 

We have just seen that the first formulation can be strengthened to "There are true 
sentences of arithmetic that are not theorems of P.A." and that this implies the second 
formulation. The second formulation implies this strengthening of the first formulation, 
because ZATitself is a complete theory: Since JATis complete and P.A. is incomplete, there 
are sentences in ZATthat are not theorems of P.A. 

We still have not formulated Godel 's first incompleteness theorem in the manner in 
which nowadays it is most commonly formulated. We will be able to do that after we 
give the following definition: 

Definition: A set of sentences S is correctly axiomatizable if and only if there is a decidable 
set A of true formulas such that all members of S are theorems of A. When S is correctly 
axiomatizable in this way, we call the decidable set .4 the nonlogical axioms (i.e., other 
than those in QL), 

In light of this definition, what we have shown is the following: 

Godel 's First Incompleteness Theorem (third formulation): The set of truths of 
arithmetic is not correctly axiomatizable. 

Proof: Suppose that ZATis correctly axiomatizable. Then there is a decidable set of 
true formulas Ax such that every member of iATis a theorem of Ax. By Church's 
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thesis, the set of codes of members of Ax is recursively decidable. In that case, we 
can find a 2i-formula Ax(x) that expresses that set and prove, just as we did above in 
the Arithmetization of Proof for P. A., that the set of theorems of Ax is 2i and 
therefore arithmetic. So by the proofs of either of the other formulations, not every 
member of .Wis provable in Ax, contrary to assumption. 

Note: There is reason not to be satisfied with these proofs, namely, that we have had to 
assume that the theorems of P.A. are all true. Kurt Godel's own original proof (published 
in German in 193 1) did not assume this, in fact. All he assumed was that the theory was 
co-consistent, which means that for each formula F, if for all n, F( n ) is true, then VvF(v) 
is true. (True theories need not in general be co-consistent, since not every object has a 
name; but every natural number has a name; so one might expect that a theory of 
arithmetic would be to-consistent.) So there is a place for us to try to prove something 
still more general. We will do that, in fact, as a short detour (in Lesson 10) on our way to 
proving the undecidability of first-order logic. This more general theorem will not take 
for granted the truth of a theory arithmetic or even its co-consistency, but only its 
consistency. 

The Enumerability ofNonformulas 

Now, while we are thinking about the recursive enumerability of formulas, having shown 
that the set of codes of formulas is 2i, I want to show also that the set of codes of strings 
of symbols that are not formulas is also recursively enumerable, i.e., 2i. There is a small 
reason to do that now, and a larger reason to have that result on board for use in proving 
the more general form of Godel's First Incompleteness Theorem (in Lesson 10). 

The small reason to prove now that the set of codes of nonformulas is enumerable is that 
we still cannot quite show that Con(P.A.) is correctly axiomatizable. The interest in the 
Godel's incompleteness theorem, in the third formulation above, would seem to be 
somewhat diminished if not even Con(P.A.) is correctly axiomatizable. The problem is 
that we do not yet have a proof that the set of axioms of P.A. is decidable. Intuitively, 
that set is decidable (just see whether a given sentence has the form of one of the axiom 
schemata or not). But we would like to have an honest proof. We have seen that the set 
of codes of axioms is 2i, since Ax(x) is 2. So it would suffice to show that the 
complement of the set of codes of axioms is 2i too. That would show that the set of 
codes of axioms as recursively decidable. 

Inspection of the definition of Ax(x) reveals that the only occurrences of unbounded 
quantifiers are in the formulas Term(x) and Form(x). (The first of these formulas is 
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3y(TermSeq(y) a x In y); the second is 3y(FormSeq(y) a x In y).) So our challenge 
is to find equivalent formulas (i.e., expressing the same sets of numbers) containing only 
bounded quantifiers. Here I follow the argument on pp. 53-54 of Smullyan. 

Definition: ji(x) = \3 ((x > x) + x+l} 

Our interest in this strange function is that it will provide the bound that we can add to 
our existential quantifiers. 

Recall from item 4 above (concerning sequences) that Ku is the set of codes whose base 
13 numerals do not contain 6. That is, members of K u are not codes for sequences, only 
codes of strings of symbols in the language of arithmetic. 

Theorem: Suppose {a\, . . . , at) is a sequence of numbers in Ku, and choose n such that 
k < n and for all i < k, we have a t < n. Let x = baib . . . Sa/tS. Then x < n{n). 

Proof: Let y = bnbn . . . bnb. 

In other words, y is a number whose numeral in base 13 consists of the numeral 5 
followed by n occurrences of nb. 

It is evident that x = ba\b . . . bakb < bnbn . . . bnb = y. So to show that x < n(x), it will 
suffice to show thaty < n(x). 

For any number z, let L(z) be the length of the base 13 numeral denoting z. There are n 
occurrences of the numeral for n in the base 13 numeral fory, and there are n +1 
occurrences of 6 in that numeral. So L(y) = (n x L(n)) + n + 1 . But the length of the 
numeral for a number is never greater than the number. So L(n) < n. L(y) <(n x n) + n + 
1 . Moreover, if we take the length of a numeral for a number and raise the base to the 
power of that length, the result is always a larger number. (For example, writing in base 
ten, 1(967) = 3, and 10 3 = 1000.) Soy<13 i(v) . Soy < 13 (( "-' ,) + " +1) = Jt(n). 

End of Proof 

Recall that Term(x) is defined as 3y(TermSeq(y) a x In y) (item 13 above), where 
(TermSeq(y) a x In y) is 2 . We now want to show (i) that the formula 
(3y < jt(x))(TermSeq(y) a x In y) expresses this set as well, and (ii) that this set is 
recursive (both it and its complement are 2i). 
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(i) (3y < jt(x))(TermSeq(y) a x In y) expresses the same set as Term(x): 

Obviously, every member of the set expressed by (3y < jt(x))(TermSeq(y) a x In y) 
is in the set expressed by Term(x). The challenge is to show the converse. 

For a given term t, suppose s is a shortest sequence (there might be several, all 
equally short) such that (#(0, #(s)) is a member of the set of pairs expressed by 
(TermSeq(y) a x In y). 

For some a\, . . ., at, the code for s is 6ai6 . . . 6^6. Since s is a shortest such 
sequence, at is the code of t. 

Moreover, the code for an expression is never less than the number of term-forming 
operations required to construct the expression. So k < a^ And for all i < k, we have 
a, < ak. So by the Theorem, SaiS . . . baifi < Jt(a^). So (#(0, #(?)) is a member of the 
set of pairs expressed by y ^ Jt(x) as well. So #(0 is a member of the set expressed by 
(3y < jt(x))(TermSeq(y) a x In y). 

(ii) Both (3y < jt(x))(TermSeq(y) a x In y) and its negation express 2i sets (so that the 
set expressed is recursive): 

This formula is obviously equivalent to 

3z(3y < z)(z = jt(x) a TermSeq(y) a x In y), which in turn is equivalent to 

3v3w3u3z(3y < z)((x •x) = vav + x = waw+1=ua z=13 u a TermSeq(y) 
a x In y) 

which is 2 and which, therefore, expresses a 2i set. (Here we utilize the assumption 
that exponentiation is The negation of this formula is equivalent to: 

(Vy < jt(x))(TermSeq(y) -(x In y)), 

which in turn is equivalent to 

3z(z = jt(x) a (Vy < z)(TermSeq(y) -» -(x In y))). 

Since TermSeq(y) is 2 , this formula can similarly be seen to be 2. Therefore it 
expresses a2i set. 
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Similarly, we can show that the complement of the set of codes of formulas is 2i. Let 
NonForm(x) be a 2i-formula that expresses the complement of the set of codes of 
formulas. 

Next, for each axiom scheme and each axiom, we can write a 2-formula that expresses 
the complement of the set of codes of axioms that belong to type. For example, the 
complement of the set of codes of axioms of type LI (see Lesson 5), is expressible by 
means of the following 2-formula: 

(Vy < x)(Vw < x)(Vz < x) ((w imp y = z a y imp z = x) -*■ NonForm(x)) 

(This is 2 because the antecedent of the conditional is 2 .) Finally, we can construct a 2- 
formula that expresses the complement of the set of codes of axioms by conjoining these 
formulas that express the complements of the set of codes of axioms of a given type. 
Since the result is 2, we know, by Smullyan's theorem, that it is expressible by a 2i- 
formula as well. 

So since the both the set of codes of axioms and the complement of that set are 2i, the set 
of codes of axioms is recursively decidable. 
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Recall that by a theory I just mean a set of formulas. A theory that will be of use to us (in 
proving the undecidability of first-order logic) is the system R (after Raphael Robinson). 
R consists of: 

Qi: All sentences ( m + n ) = k, where m + n = k. 

Q2: All sentences ( m • n ) = k, where m x n = k. 

Q3: All sentences rn ± n , where m and n are distinct numbers. 

Q4: For each n, v* < n «* (v* = v v* = 0' v ... v v* = n ). 

Q5: For each n, v* ^ n v n < v*. 

Again, the "axioms" of a theory are understood to include not only the special axioms 
that are actually members of the theory but also the logical axioms of QL (not to be 
confused with Q). To say that a sentence P is provable in a theory A is to say that there 
exists a proof in which the last line is P, where a proof is a sequence of sentences in the 
pertinent language and which every line is either a member of A (an axiom) or can be 
derived from earlier lines by either Modus Ponens or Generalization. In the case of R, 
the axioms will be any of the axioms of QL as well as any of the axioms specified by 

One more bit of terminology: To say that a formula is refutable in a theory is to say that 
there is a proof of the negation of that formula in the theory. If a theory is not 
syntactically complete, then there will be formulas that are neither provable nor refutable 
in the theory. 

So far, we have been proving our theorems by discovering facts about expressibility, viz., 
the expressibility of sets of numbers by formulas in the language of arithmetic. In order 
to introduce a different way of doing things, let me first formulate in other terms the 
concept of expressibility. What we have been saying is that that F(Vi , . . . , v m ) expresses 
the relation R of m-tuples if and only if: 

F(n 1, n m ) is true if and only if (m, n m ) G R. 

The same concept could be expressed in different words as follows: Recall that .Wis the 
set of truths of arithmetic. If a sentence is a truth of arithmetic, and therefore a member 
of N, then it is of course provable in SN~, and if a sentence in the language of arithmetic is 
false, so that its negation is true and therefore provable in !N, then the sentence is 
refutable in !N. (Of course, by Godel's Theorem, we know that ZATwill not be correctly 
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axiomatizable.) So we could just as well say that F(Vi, v m ) expresses the relation R of 
m-tuples if and only if the following two conditions hold: 

(1) If (m, n m ) G R, then F( n i,..., n m ) is provable in !N. 

(2) If (m, n m )<$R, then F( fi u ..., n m ) is refutable in M 

Now we are going to generalize the concept of expressing a set by allowing that the 
pertinent provability and refutability might be some theory less than all of SN~. In 
particular, we will be interested in what is provable and refutable in R. Also, in 
generalizing in this way, we will substitute the word "define" for the word "express". So 
we will speak of defining a set in a theory rather than expressing a set. So: 

A formula F(v) is said to define a set A in a theory Th if and only if for all numbers n, the 
following two conditions hold: 

(1) IfnGA, then F( n ) is provable in Th. 

(2) 1fn$A, then F( n) is refutable in Th. 

A formula F(Vi, . . . , v m ) is said to define an m-ary relation R in a theory Th if and only if 
for all numbers, n\, ...,n m : 

(1) If (m, n m ) G R, then F( iT i,..., iT m ) is provable in Th. 

(2) If (n u n m )<£R, then F(rii,...,n m ) is refutable in Th. 

In other words, if a formula defines a relation in a theory, then, returning to our earlier 
use of the term "express", we could say that the formula expresses the relation from the 
point of view of the theory. 

As for functions, a formula F(Vi, . . . , v m , v m +i) strongly defines an m-ary function fun in a 
theory Th if and only if for all numbers, rt\, ...,n m , n m +\\ 

(1) If fun(n\, n m ) = n m +\, then F( n i,..., n m , n m+ i) is provable in Th, and 

(2) xifun{n\, n m ) 4- n m +\, then F( FT i, , n m , n m+ i) is refutable in Th, and 

(3) if fun{n\, n m ) = n m +\, then the sentence, 

Vv(F(rTi,..., n m ,v) ^ v = n m+1 ) 
is provable in Th. 

(So "strong definability" pertains to functions only. Without condition (3), we could not 
say that "according to Th'\fun really is a function, with a unique output for each of its 
arguments.) 
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A set (relation, function) is definable in a Th if and only if there exists a formula in the 
language of the theory that defines it. 

What we now want to work our way up to is the following claim: All recursive sets and 
relations are definable in R, and all recursive functions are strongly definable in R. (If 
necessary, review the definitions of recursive sets and relations, i.e., recursively definable 
sets and relations, and recursive functions in Lesson 6.) 

(As usual, terminology varies. The term "represent" may be used where here I say 
"define". Here I follow Smullyan.) 

Exercise (easier than it sounds): Prove that a set of numbers is arithmetic if and only if it 
is definable in ZAT(the set of truths of La). 

Outline of Proof: 

Definition: We say that a formula F(vi, V2) enumerates a set A in a theory Th if and only 
if for every number n, the following conditions hold: 

(1) If n G A, then there is a number m such that F( n , m ) is provable in Th. 

(2) If n £ A, then for every number m, F( n , m ) is refutable in Th. 

Definition: We say that a formula F(vi, V2, v m , v m +i) enumerates a relation R in a 
theory Th if and only if for all numbers m, n m , the following conditions hold: 

(1) If (m, n m ) G R, then there is a number n such that 
F( n 1, . . . , n m , n ) is provable in 77?. 

(2) If (m, n m ) (£ R, then for every number n, 
F( n 1, . . . , n m , n ) is refutable in Th. 

Definition: We say that a formula F(vi , V2, . . ., v m ) separates relation A from relation B 
in a theory Th if and only if: 

(1) If (m, n m ) G A, then F( n 1,..., n m ) is provable in 77?. 

(2) If (m, n m ) G B, then F( n 1, . . . , n m ) is refutable in Th. 

(Similarly, for separation of sets.) 
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Definition: A theory is said to be 2 -complete if and only if all true 2 -sentences are 
provable in it. 

We will utilize the following four theorems: 
R is Zo-complete: R is 2 -complete. 

The Enumeration Theorem: If Th is 2 -complete, then all 2i relations are enumerable in 
Th. 

The Separation Theorem: If all axioms of types Q4 and Q5 (in R) are provable in Th, 
then if A and B are disjoint relations enumerable in Th, then A and B are separable in Th. 

The Definability Theorem: If any two disjoint 2i relations are separable in Th, then all 
recursive relations are definable in Th. 

From these four theorems, we can immediately derive: 

Theorem 1: All recursive relations are definable in R. 

Proof: By the Definability Theorem, it suffices to show that any two disjoint 2i 
relations are separable in R. Let A and B be two disjoint 2i relations. Since R is So- 
complete, the Enumeration Theorem tells us that A and B are enumerable in R. So by 
the Separation Theorem, they are separable in R. 

Note: Throughout the following proofs, I will take for granted facts about provability 
that depend only on the logical axioms (from Lesson 5). 

R iz ^-complete (proof): 

(For a different presentation, see Smullyan, pp. 66-70.) 

Proposition 1: If w is any variable or numeral, (w<n«(w=0vw = 0' v ... v w = 
n )) is provable in R — by Q4, Generalization, and Universal Elimination (a derived rule; 
see example 3, Lesson 5). 
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Proposition 2: All true atomic 2 -sentences are provable in R. 

Case (i): Sentences of the form fn = rtT are provable in R because they are theorems 
of QL. 

Case(ii): True sentences of the form m < n are provable in R: Suppose that m< n. 
Since fn = fn is provable, sois(m=0vfn = 0' v ... v m = m v ... v m = n). 
So by Proposition 1 , m < n is provable in R. 

Case (Hi): By Q\ and Q2, true sentences of the form (fn + n) = k and (fn* n ) = k are 
provable in R. 

Proposition 3: All false atomic 2 -sentences are refutable in R. 

Case (i): By Q 3 , false sentences of the form fn = n are refutable in R. 

Case (ii): Suppose rn < n is false. So fn = 0, fn = 0', fn = n are all false. So 
by case (i), they are all refutable. So (fn = v m = 0' v ... v fn = fi) is refutable. 
So by Proposition 1, m < n is refutable. 

Case (Hi): Suppose ( m + n ) = k is false. Then for some number p^k,m + n=p. 
By Proposition 2, ( fri+ n ) = p is provable in R, and, by Case (i), p ^ k is provable 

in R. So, by first-order logic, ( fn + n ) ^ k is provable and ( fn + fi) = k is refutable in 
R. 

Case (iv): Suppose (m , n)=kis false. Similar to Case (iii) . . . 

Proposition 4: Suppose F(w) is a 2 -formula having only w free. Suppose that F(0), 
F(O'), F( fi), are all provable in R. Then (Vw < fi )F(w) is provable in R. 

Proof: Assume the hypothesis. Then each of (w = -* F(w)), (w = 0' -* F(w)), 
(w = fi -> F(w)) is provable in R. So ((w = 0vw = 0'v ... vw = n) -» F(w)) is 
provable inR. But by, Proposition l,(w<fi^(w=0vw = 0' v ... v w = n )) is 
provable in R. So (w ^ n -* F(w)) is provable in R. So by Generalization, Vw(w ^ 
fi -> F(w)) is provable in R. But this is (Vw < fi )F(w). 
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Theorem: R is 2 -complete. 

We prove something stronger: Every true 2 -sentence is provable in R and every false 
2 -sentence is refutable in R 

Proof: By induction on the complexity of sentences. 

Basis: True atomic 2 -sentences are provable in R.A, by Proposition 2. False atomic 
2o-sentences are refutable in R, by Proposition 3. 

Induction Hypothesis: Suppose that all 2 -sentences having complexity less than or 
equal to k are provable if true and refutable if false. 

Induction Step: 

Case -i : If -> P is true, then P is false, in which case, by IH, P is refutable, i.e., -> P is 
provable. If -> P is false, then P is true, in which case P is provable, and -> -> P is 
provable. 

Case ->: Exercise. 

Case (Vv ^ n) (bounded quantifiers): Recall, from the definition of 2o-sentences in 
Lesson 6 that the only remaining case is that of sentences of the form (Vv ^ n )F(v). 

Case 1: (Vv < n)F(v) is true. Then each of F(0), F(O'), F(fi) is true. So by 
the induction hypothesis, each of them is provable in R. So by Proposition 4, (Vv 
< n )F(v) is provable in R. 

Case 2: (Vv < n )F(v) is false. Then for at least one m<n,F( fri) is false and, by 
IH, refutable in R. So by Proposition 2, case 2, both fn < n and -> F( fn ) are 
provable in R. So (Vv < n)F(v) is refutable in R. 

End of proof 
The Enumeration Theorem 

We will now prove that if Th is 2 -complete, then all 2i relations are enumerable in Th. 



L9: Definability 



Page 117 



Proof: Suppose that Th is 2o-complete. Leti? be any 2i relation, and let R(xi, x 2 , 
. . . , x m ) be the 2i-formula that expresses it. Then (by the definition of 2 1 -formulas), 
there is a 2 -formula S(xi, X2, x m , y) such that: 

R(xi, x 2 , x m ) is true if and only if 3yS(xi, x 2 , x m , y) is true. 

We show that S(xi, x 2 , x m , y) enumerates the relation R in Th. 

1. Suppose that (m, n m ) G R. Then R( n 1,..., n m ) is true, and for some number 
n, S( n lv .., n m , n ) is true. But this is a 2 -sentence; so it is provable in Th. 

2. Suppose (m, n m ) (£ R. Then R( n 1,..., n m ) is false, and for every n, S( n 1,..., 
n m , n ) is false and -■ S( n 1,. . ., n m , n ) is true. But the latter is a 2 -sentence. So 
for every n, S( n 1,. . ., n m , n ) is refutable. 

End of proof. 



The Separation Theorem 

For simplicity, consider just the case of sets, as opposed to relations. Suppose that all 
axioms of types Q4 and Q5 are provable in Th. Suppose also that A and B are disjoint sets 
enumerable in Th. We prove that A and B are separable in Th, i.e., that there is a formula 
F(x) such that: 

(1) If n G A, then F( n ) is provable in Th. 

(2) If n e B, then F( n ) is refutable in Th. 

Let A(x, y) be the formula that enumerates A, and let B(x, y) be the formula that 
enumerates B. We prove that F(x) is Vy(B(x, y) -* (3z < y)A(x, z)), i.e., that this latter 
formula separates A and B. 

1. Suppose that «£l We need to show that Vy(B( n , y) -* (3z < y)A( n , z)) is 
provable in Th. 

Since A(x, y) enumerates A, there is some k, such that A( n , k ) is provable in Th. 

Since A and B are disjoint, ntfiB. Since B(x, y) enumerates B, for every m < k 
(indeed for every m), B( n , m) is refutable and B( n , fn ) is provable. 
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Soeachof(y = 0^ -B(n,y)), (y = 0' -» -B(n,y)), ...,(y= k - ^B(n,y))is 
provable in 77?. 

So ((y = v y = 0' v ... v y = k) -» -B(n, y)) is provable in Th. 

By Proposition 1 (which uses Q 4 ), (y<k^(y=0vy = 0' v ... v y = k)) is 
provable in Th. 

So (y < k - B( fi , y)) is provable in 77?. 
So (B( FT , y) -> -y < k) is provable in 77?. 
So by Q 5 , (B( fi , y) -» k < y) is provable in 77?. 

Since A(fi, k) is provable too, (B( n , y) -» (k < y a A(fi, k))) is provable in Th. 

So Vy(B(n, y) (3z < y)A(n, z)) is provable in Th. 

2. Suppose nGB. We need to show that Vy(B( fi , y) -> (3z < y)A( fi, z)) is refutable 
in 77?. 

Since B(x, y) enumerates B, there is some k, such that B( n , k) is provable in 77?. 

Since A and B are disjoint, ntfiA. Since A(x, y) enumerates A, for every m < k 
(indeed for every m), A( n , m ) is refutable and -■A( n , m ) is provable. 

Reasoning as above (using Q 4 ), (z < k -*■ -> A( n , z)) is provable in 77?. So by 
Generalization, (Vz < k)->A( n , z) is provable in Th. 

So (B(n, k)A(Vz< k)-A(fi,z)) is provable in Th. 

So -(B(n, k) -» -(Vz< k)-A(fi,y)) is provable in Th. 

But this is -(B(n,k) (3z < k)A(n, z)). So (B( n, k) (3z < k)A(fi,z))is 
refutable in Th. 

So Vy(B(n, y) (3z < y)A(n, z)) is refutable in 7%. 
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The Definability Theorem 

Lastly, we have to observe that if any two disjoint 2i relations are separable in Th, then 
all recursive relations are definable in Th. 

Suppose that any two distinct 2i relations are separable in Th, and let R be a recursive 
relation. Since R is recursive, both R and R are 2i. Obviously, R and R are disjoint. So 
R and R are separable in Th, i.e., there is a formula F(vi, V2, v m ) that separates 
relation R from relation R in a theory 7%, which means: 

(1) If (m, G i?, then F( n 1,..., n m ) is provable in 77?. 

(2) If n m ) G /?, then F( n 1,..., n m ) is refutable in 77?. 

(2) can be rewritten thus: 

(2') If (ni, n m ) ^ i?, then F( n 1,..., n m ) is refutable in Th. 
So, by definition of definability, R is definable in Th. 

This completes the proofs of the four theorems that we needed to prove in order to prove 
that all recursive relations are definable in R. 

The definability of recursive functions 

There is one more task I wish to complete in this lesson, and that is to show that recursive 
functions are strongly definable in R. (Recall that that involves a third condition, (3), 
above.) 

Theorem 2: All recursive functions are strongly definable in R. 

Proof For simplicity, we confine our attention to functions of one argument. 

Suppose that fun is a recursive function of one argument. Since all recursive relations are 
definable in R (Theorem 1), there is a formula F(x, y) that defines the relation fun(x) =y. 
This means that: 

(i) lffun(n) = m, then F( n , m ) is provable in R. 

(ii) If funin) ^ m, then F( n , m) is refutable in R. 
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Let G(x, y) abbreviate the formula (F(x, y) a Vz(F(x, z) -» y < z)). We will show that 
G(x, y) strongly defines fun, i.e., that: 

(1) If funiji) = m, then G( n , m ) is provable in R. 

(2) If fun(n) ± k, then G( fi , k ) is refutable in R. 

(3) If fun{n) = m, then the sentence, 

Vv(G(n,v)^v = fn) 
is provable in R. 

Suppose fun{n) = m . 

(1) If k < m, then, by (ii), F( n , k) is refutable in R; so (F( fi , k ) -> fn < k) is provable 
in R, and (z = k -» (F( n , z) -> m < z)) is provable. For instance, if < m, then 
(F( FT , 0) -> m < 0) is provable, and (z = -> (F( n, z) -> m < z)) is provable. 

If k = m, then m and k are the same numeral, so that fn = k is provable and, by Q4, 
m < k is provable; so (F( n, k) -> m < k) is provable, and (z = m -» (F( n, z) -> 
FfT < z)) is provable. 

So each of (z = -» (F(n,z) m < z)), (z = 0' (F( n, z) ^ m < z)), 
(z = m -* (F( n , z) -* m < z)) is provable. 

So ((z = v z = 0' v ... v z = m) -> (F(n, z) -* m < z)) is provable. 

By Proposition l,(z< m ^ (z = v z = 0' v ... vz = m)) is provable. 

So (z < m -> (F( n, z) -> m < z)) is provable. 

( fn < z -> (F( n , z) -» fn < z)) is provable, by propositional logic. 

So, by ^5, (F( n , z) -* ffi < z) is provable, and by Generalization, Vz(F( fi, z) -» 
fn < z) is provable. 

By (i), F( FT , ffi ) is provable in R. 

So (F(n, fn) a Vz(F(fT, z) -> m < z)), i.e., G(fT, fn) is provable in R. 

(2) Suppose k ^ m. Then, by (ii), F( n , k ) is refutable in R. 

So (F(fT,k) a Vz(F(n, z) k < z)), i.e., G(n,k) is refutable inR. 
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(3) To show that Vv(G( n , v) -* v = m ) is provable, we will show: 

(a) (G( n, v) -» v < m) is provable, and 

(b) (v < m -> (G( fi , v) -» v = m)) is provable. 

From (a) and (b), we derive, by propositional logic, (G( n , v) -» v = fn ), from 
which we obtain Vv(G( n , v) -» v = fn ) by Generalization. 

(a) Since G(n, v) is (F(n , v) a Vz(F(n, z) -» v < z)), 
(G(n, v) -* Vz(F(n, z) -* v < z)) is provable. 

So (G(n, v) -» (F(n, m) -> v < fn)) is provable. 

By (i), (F(n,ffi) is provable. 

So (G( n, v) -* v < ffi ) is provable. 

(b) If k < m, then by (2), G( n , k ) is refutable; so (G( n , k ) -» k = ffi) is provable. 
If A: = m, then k = ffi is provable; so (G( n , k) -* k = ffi) is provable. 

So, by reasoning in the by-now-familiar way (see (1) in the proof of this 
theorem), (v < ffi -> (G(fi,v) -> v = ffi)) is provable. 



End of proof 



Lesson 10: The Upper Diagonal Lemma and Some 
Consequences 



Our objective is to prove another diagonal lemma. This other diagonal lemma will be of 
use in proving that no consistent extension of R is decidable. That in turn will lead fairly 
directly to a strong form of Godel's First Incompleteness Theorem and to the 
Undecidability of First-order Logic. 

The strong definability of the diagonal function 

Proposition 1: For any n-ary function fun, if the relation fun(x\, %2, ...,x n )= x n +\ is 2i, 
then fun is recursive (by which I mean that fun is recursive in the sense of "recursive" 
that we defined for functions in Lesson 6). (Recall that "recursive" is short for 
"recursively decidable".) 

Proof: Suppose/Mft(xi, X2, ...,x„) = x„+i is 2i. And suppose F(xi, X2, x„, x n +i) is 
a 2 1 -formula that expresses it. Then the complement of this relation, viz., the relation 
fun{x\, X2, ...,Xr)4- Xn+u is 2i too, because it can be expressed with the following 2- 
formula: 

3y(F(x 1 , x 2 , x n ,y) a y ^x n+1 ) 

Say that E„ is the expression for which n is the Godel number. So, recall, diag{n) = m if 
and only if m is the code of Vv(v = n -» E„). 

Recall that in Lesson 7, we found that diag is arithmetic. In fact, inspection of the 
formula that expresses it shows that it is 2 1 . (We cannot find a 2 -formula that expresses 
it, because we do not have a 2 -formula that expresses exponentiation.) 

Refer back to Lesson 9 for the definition of strongly definable function. 

Theorem: The diagonal function diag is strongly definable in R. 

Proof: As we have seen, diag is 2i. So by the above proposition, it is recursive. So 
by Theorem 2 of Lesson 9, it is strongly definable in R. 
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The upper diagonal lemma 

Before proving our new diagonal lemma, we must prove the following proposition: 

Proposition 2: If a one -place function fun is strongly definable in a theory Th, then for 
any formula F(v) (in the language of the theory) there is a formula H(v) such that for any 
number n, if fun{n) = m, then the following sentence is provable in Th: 

(H(n)-F(m)) 

Proof: Suppose K(v, w) strongly defines fun in Th. We show that if fun{n) = m, 

(3w(K(n,w) a F(w)) — F(m)) 
is provable in Th. 
Suppose fun{n) = m. 

1. Since K(v, w) strongly defines K(n , m) is provable in Th. So 

(F(m) -> (K(n, m) a F(m)) is provable; so (F(m) -> 3w(K(rT, w) a F(w)) is 
provable. 

2. Since K(v, w) strongly defines fun, Vv(K(rT, v) -> v = m) is provable in Th. 
Therefore, Vv((K(n , v) a F(v)) -> (v = m a F(v)) is provable; so Vv((K(n, v) a 
F(v)) -» F(m)) is provable; so (3w(K(n , w) a F(w)) -* F(m)) is provable. 

End of proof 

Definition: Where X is an expression (of the language of arithmetic), let X be the 
numeral denoting #(X), the Godel number of X. 

Definition: A sentence G is a fixed point of a formula F(v) in a theory Th if and only if 
the sentence (G ** F( G )) is provable in Th. 

Definition: We say that one theory Thl is an extension of another theory Th2 if and only 
if all of the theorems of Th2 are also theorems of Thl, i.e., Con(Th2) Q Con(Thl). 
(Unless otherwise noted, we assume that an extension of a theory is in the same language 
as the theory.) 
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The Upper Diagonal Lemma: Every formula F(v) of La has a fixed point in any 
consistent extension Th of R. 



Proof: Let F(v) be a formula in La, and let Th be a consistent extension of R. Since 
diag is strongly definable in R, and hence in Th, we know by Proposition 2 that there 
is a formula H(v) such that for any number n, if diag{n) = m, then the following 
sentence is provable in Th: 

(H(n)-F(m)) 

Let h be the code of H(v), and suppose diag(h) = k. So: 
(H(h)-F(k)) 

is provable in Th. So 

(Vv(v= h H(v))— F(k)) 

is provable in Th. But k is the code of Vv(v = h -» H(v)). So Vv(v = h -> H(v)) is 
a fixed point for F(v). 



Compare this result to the Lower Diagonal Lemma in Lesson 7. Here, rather than 
showing merely that diag is arithmetic, we show that, since diag is 2 1, it is strongly 
definable in R. And instead of proving that the Godel sentence for a set is true if and 
only if its code belongs to the set, we prove that the fixed point (a Godel sentence) is 
provably (in Th) materially equivalent to the sentence that says that its code satisfies 
F(v). 

The Upper Diagonal Lemma is not a consequence of the Lower Diagonal Lemma, since 
the Lower Diagonal Lemma does not deal with arbitrary consistent extensions of R. 



But the Lower Diagonal Lemma is a consequence of the Upper Diagonal Lemma: 
Suppose A is arithmetic and F(v) expresses A. Since JV(the set of truths of arithmetic) is 
an extension of R, by the Upper Diagonal Lemma, there is a sentence G such that 
(G ** F( G )) is provable in ZAf and, hence true, so that G is true if and only if F( G ) is 
true. But F( G ) is true if and only if #(G) G A. So G is true if and only if #(G) G A. 



L10 Upper Diagonal Lemma 



3/19/10 6:39 PM 



Page 125 



The undecidability of consistent extensions of R. 

Recall that if F is any formula, then #(F) is the Godel number (code) of F. 

Lemma: If Th is a consistent extension of R, then the set of codes of formulas provable 
in Th is not definable in Th. 

Proof: Suppose Th is a consistent extension of R. Let P be the set of codes of 
formulas provable in Th, and suppose, for a reductio, that P is definable in Th. 
Suppose that H(v) defines P in Th. So: 

If n G P, then H( n ) is provable in Th. 

IfntfiP, then H( n ) is refutable in Th, i.e., -> H( n ) is provable in Th. 
By the Upper Diagonal Lemma, there is a fixed point for -> H(v), i.e., a sentence G 



is provable in Th. We show both that G is provable and not provable (in Th). 

G is not provable: Suppose, for a reductio, that G is provable. Then #(G) G P. So 
H( G ) is provable. So by the provability of the above biconditional, --G is provable. 
So, since Th is consistent, G is not provable. 

G is provable: Suppose, for a reductio, that G is not provable. Then #(G) ^ P. So 
-> H( G ) is provable. So by the provability of the above biconditional, G is provable. 

Contradiction! 

Theorem (the undecidability of consistent extensions of R): If Th is a consistent extension 
of R, then the set of codes of theorems of Th is not recursive (so that, by Church's thesis, 
the set of theorems of Th is not decidable). 

Proof: Suppose that Th is a consistent extension of R. By the above Lemma, the set 
of codes of theorems of Th is not definable in Th. But Theorem 1 of Lesson 9 states 
that all recursive relations are definable in R. So all recursive relations are definable 
in any extension of R. So the set of codes of theorems of Th is not recursive. 



such that 



(G 



-H(G)) 
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Note: We call this result the "undecidability of consistent extensions of R", although 
strictly speaking what is undecidable is not the set Th but the set Con(Th). In general, we 
will say that a theory is undecidable when what we mean is that the set of theorems of the 
theory is undecidable. (As I said at the beginning of Lesson 8, some people reserve the 
term "theory" for sets of sentences closed under first-order consequence.) Later, we will 
say that first-order logic is undecidable when what we mean is that set of theorems of QL 
is undecidable. 

Advisement: Nothing in subsequent lessons will depend on any of the rest of this lesson. 
The rest of this lesson is here for its intrinsic interest. 

Tarski Undefinability Revisited: 

I pause now to prove a second version of the Tarski Undefinability Theorem. We will 
have no further use for this, but it is the form of the theorem that one usually encounters 
in the literature (e.g., the literature on the liar paradox). 

Definition: Say that T(v) is a truth-predicate for a theory Th if and only if for every 
sentence S, the sentence (S T( S )) is provable in Th. 

Tarski "s Undefinability Theorem (upper version): If Th is any consistent extension of R, 
then there is no truth-predicate for Th. 

Proof: Suppose, for a reductio, that T(v) is a truth-predicate for Th, a formally 
consistent extension of R. By the Upper Diagonal Lemma, -T(v) has a fixed point, 
call it G. So (G ->T( G )) is provable in Th. But since T(v) is a truth predicate, (G 
** T( G )) is provable in Th. So T( G ) ->T( G ) is provable in Th. So Th is 
formally inconsistent, contrary to assumption. 

Corollary: There is no truth predicate for !N, the set of truths of arithmetic. 

Proof: Wis a consistent extension of R. 
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The decidability of effectively enumerable, complete theories: If Th is complete and the 
set of codes of theorems of Th is recursively enumerable, then the set of codes of 
theorems of Th is recursive. 

Recall that by a complete theory, we mean a theory such that for every sentence P (in the 
language of the theory), either P or -> P is a theorem. 

Nota bene: We do not say that in a complete theory every formula or its negation is a 
theorem. However, if a theory is complete, then for every formula F, if Vvi . . . Vni - is a 
sentence, then either Vvi . . . V n F or -> Vvi . . . V n F is a theorem. 

We show that if a theory is complete, and the set of codes of theorems is recursively 
enumerable (i.e., then the set of codes of theorems is recursive (i.e., recursively 
decidable). By Church's thesis, it follows that if a theory is complete and its theorems 
are effectively enumerable, then the set of theorems is decidable. 

Suppose that Th is complete and P, the set of codes of theorems of Th, is 2i. To obtain 
our result, all we have to do is show that P is Zi. (P includes numbers that are codes of 
expressions that are not even formulas, as well as codes of formulas that are not 
theorems.) 

Case 1: Th is inconsistent. P is the empty set and therefore 2i. 
Case 2: Th is consistent. 

Say that a formula P is the opposite of a formula Q if and only if either Q = -> P or P 
= -■ Q. Say that a sentence P is a universal closure of a formula Q if and only if: if 
v-i, . . . , v n are the variables in Q, then P = Vvi . . . Vv n Q. (For simplicity, we do not 
assume that all of Vi , . . . , v n are free in Q. So some of the quantifiers in the universal 
closure may be vacuous.) I take for granted that the following relations are Zi: 

x is a code of an opposite of the formula of which y is the code 

x is a code of a universal closure of the formula of which y is the code 

Let us abbreviate the Zi formulas that express these relations as Opp(x, y) and 
UC(x, y), respectively. 

Recall from the end of Lesson 8 that the set of codes that are not codes of well- 
formed formulas is Zi. Let NonForm(x) abbreviate the Zi-formula that expresses 
that set. 
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Since the set of codes of theorems of Th is recursively enumerable, there is a 2i 
formula Prov(v) that expresses it. Since Th is consistent and complete, the following 
is a 2-formula that expresses the set of codes of nonformulas and unprovable 
formulas, P. 

(NonForm(x) v 3y(Prov(y) a (Opp(x, y) v 3z(UC(z, x) a Opp(y, z)))) 

For example, if F(v) is not provable, then either its opposite is provable or -> VvF(v) 
is provable. 

Reasoning informally, the truth of this theorem should be clear. If Th is a complete 
theory and its theorems are enumerable, then it is decidable whether any formula is a 
theorem of Th, because for every formula, either it or its negation or the negation of its 
universal closure is bound to show up in the enumeration (and if the negation of the 
universal closure of F shows up, and the theory is consistent, then we know that F is not 
going to show up). 

In Lesson 8, we defined the concept of a correctly axiomatizable theory. We are not 
concerned with correctness (truth) any more. So we will say that a theory is 
axiomatizable if and only if there is a decidable set of formulas A such that all of the 
members of the set are consequences of A. 

Godel's First Incompleteness Theorem (fourth formulation): No consistent, complete 
extension of R is axiomatizable. 

Proof: Suppose, for a reductio, that Th is a consistent, complete, axiomatizable 
extension of R. By analogy with the arithmetization of proof in P. A. in Lesson 8, the 
set of codes of theorems of Th is 2i. By the decidability of effectively enumerable, 
complete theories, the set codes of theorems of Th is recursive. By the undecidability 
of consistent extensions of R, the set of codes of theorems of Th is not recursive. 
Contradiction! 

This of course entails that .Wis not axiomatizable (and thus not correctly axiomatizable, 
our third formulation, in Lesson 8), because JATis a consistent, complete extension of R. 

The virtue of this fourth formulation and its proof, in comparison with our earlier proofs, 
is that we did not have to assume that R is true, only that it is consistent. And R is a very 
weak theory. (Just look at it!) 



Lesson 11: The Undecidability of First-order Logic 



Another theory that will be of use to us is the theory called simply Q. Q is just the first 
nine axioms of P. A. (so, PA. minus the induction scheme), which I repeat here for 
convenience: 



Nl 


(x' = / -* x = y) 


N2 


-0 = x' 


N3 


(x + 0) = x 


N4 


(x + y') = (x + y)' 


N5 


(x • 0) = 


N6 


(x • y') = ((x • y) + x) 


N7 


(x < ** x = 0) 


N8 


(x<y'«(x<yvx = y')) 


N9 


(x < y v y < x) 



Let A be the universal closure of the conjunction of these nine formulas And suppose, 
for a reductio, that first-order logic is decidable (i.e., it is decidable whether any given 
formula is a theorem of QL, which means, by soundness and completeness, that it is also 
decidable whether any given formula is first-order valid). Then, assuming that the 
language of QL contains La, for any formula S of La, it is decidable whether (A -> S) is 
a theorem of first-order logic. But (A -* S) is a theorem of first order logic if and only if 
S is a theorem of Q. So Q is decidable. But we already know that any consistent 
extension of R is undecidable. So suppose we can show that Q is a consistent extension 
ofR Then Q is undecidable. Contradiction! 

So if we can show that Q is a consistent extension of R, it will follow that first-order 
logic is not decidable. So our first order of business is to show that Q is a consistent 
extension of R We will simply assume that Q is consistent; but we need to show that Q is 
an extension of R 

For convenience, I also repeat the specification of the axioms of R: 

Qi: All sentences ( m + n ) = k, where m + n = k. 

Q2: All sentences ( m • n ) = k, where m x n = k. 

Q3: All sentences rn t in , where m and n are distinct numbers. 

Q4: For each n, v* < n «* (v* = v v* = 0' v ... v v* = n ). 

Q5: For each n, v* ^ n v n < v*. 
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Theorem: Q is an extension of R That is, Con(R) C Con(Q). 

Proof: We need to prove that for each of the five kinds of axioms of R each axiom of 
that kind is a theorem of Q. That will ensure that every theorem of R is also a 
theorem of Q. 

Qi: ByN3, for each m, we have (iff + 0) = iff. By N4, we have (iff + 0') = (iff + 
0)'. So, by substitution of identicals, we have ( iff + 0') = m '. In other words, 

(m + 1) = m+1 . By N4, (m + 0") = (m + 0')'. By substitution of identicals, 
(iff + 0") = m+1 ', i.e., (m + 2) = m+2 . Similarly, (m + 3) = m+3 , 
(iff + n) = m+n , ... This gives us every axiom of type Q\. 

Q 2 : By N5, we have, for each m, ( m • 0) = 0. By N6, we have ( iff • 1 ) = 

(( m • 0) + iff). So, by substitution of identicals, we have ( m • 1 ) = (0 + m ). 
By the previous paragraph, we have (0 + iff) = 0+m . So we have ( m • 1 ) = 

0+m . So we have ( m • 1 ) = mx1 . By N6 again, we have ( m • 1 ') = 

((iff • 1 ) + iff). So, by substitution of identicals, we have (iff • 1') = ( mx1 + 
iff), i.e., ( iff • 2) = ( fn + iff). By the previous paragraph, we have ( rff+ iff) = 
mx2 . So we have ( iff • 2) = mx2 . And so on, for each n, we have ( iff • ff ) 

= mxn . This gives us every axiom of type Q2. 



Q 3 : By Nl, we have, for every m and n, ( m+1 = n+1 -> iff = ff ). By N2, for 
any positive n, t ff. So 0+1 t n+1 , i.e., 1 t n+1 . So 1+1 t n+2 , i.e, 
2 ± n+2 , and so on. This gives us every axiom of type ^3. 

Q4: By N7 we have, (v* < v* = 0). Suppose, as an induction hypothesis, that 
(v* < n (v* = v ... v v* = ff)). By N8, we have (v* < n+1 (v* < ff 

v v* = n+1 ). From these last two formulas, we derive: v* ^ n+1 «* (v* = 

v v* = 0' v ... v v* = n+1 ). 

Q5: These are all consequences of N9. 
End of proof 
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The undecidability of first-order logic 

Theorem (the undecidability of first-order logic): The set of codes of formulas of La that 
are first-order valid is not recursive. (And so, by Church's thesis, the set of valid 
formulas of the language of arithmetic is not decidable.) 

Proof: (For the basic idea, see above. We now give a precise proof.) 

Let Fbe the set of codes of first-order valid formulas of La, and let V be the 
complement of that set (the union of the set codes of nonformulas and the set of codes 
of nonvalid formulas). Suppose, for a reductio that Fis recursive. So there is a Si- 
formula Val(x) that expresses Fand a 2i-formula NonVal(x) that expresses V . 

Let A be the universal closure of the conjunction of the axioms of Q (i.e., the result of 
adding Vv*Vv** to that conjunction), and let a be the code for A. 

Then we can define a 2-formula that expresses the set of codes of formulas S such 
that (A -» S) is first-order valid. Here it is: 

3y(Val(y) a Concat 5 (2, a, 8, x, 3, y)). 

And we can define a 2-formula that expresses the set of codes of expressions S such 
that (A -» S) is not first-order valid (either not a formula at all, or not a valid 
formula). Here it is: 

3y(NonVal(y) a Concat 5 (2, a, 8, x, 3, y)). 

But these two formulas express the set of codes of theorems of Q and the complement 
of that set, respectively. So by Smullyan's Theorem, we can find 2i-formulas that 
express the same sets. So the set of codes of theorems of Q is recursive. 

But by the above theorem, Q is a consistent extension of R (since we're assuming that 
Q is consistent). So by the main result of Lesson 10, the set of theorems of Q is not 
recursive. Contradiction! 

End of proof 
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First note: We have not shown that every first-order language is undecidable, i.e., that 
the set of valid sentences of any first-order language is undecidable. We have only 
shown that the set of first-order valid sentences of La is undecidable. And in fact, the 
valid formulas of a first-order language containing exclusively monadic predicates is 
decidable. (That fact requires a proof that we will not go through.) But consider a 
language L that is similar to La to the extent that it contains at least two dyadic predicates 
(like "=" and "^"), and at least one one-place function symbol (like " ' ") and at least two 
two-place function symbols (like "+", "•"). It is clear that if we had an algorithm for 
deciding whether a formula in L was first-order valid, then we would have an algorithm 
for deciding whether a formula in La was valid, which, as we have just seen, we do not 
have. So we may conclude that first-order validity is not decidable even in this broader 
class of languages. Likewise, the set of first-order valid sentences of any language with a 
more extensive vocabulary will be undecidable, for it were decidable, then the first-order 
valid sentences of this "sublanguage" would be decidable. 

Second note: The set of valid arguments is not decidable, for if it were then the set of 
valid arguments with finitely many premises would be decidable, and if that were 
decidable, then the set of first-order valid conditionals would be decidable, and if that 
were decidable, we could likewise show that the set of theorems of Q was decidable. 

Third note: Although first-order logic is undecidable (i.e., again, the set of first-order 
valid formulas is undecidable), for any invalid argument we can demonstrate that it is 
invalid. We do so by describing a structure in which the premises are true and the 
conclusion is not true. First-order logic is undecidable because we have no algorithm for 
finding such counterexamples. 

Thought exercise: The theorems of first-order logic (which, by, soundness and 
completeness are the first-order valid formulas) are effectively enumerable (by the 
arithmetization of proof). So first-order logic would be decidable if we could effectively 
enumerate the set comprising nonformulas and formulas that are not valid. We cannot do 
this, but why not? If you were to try to find a 2 1 -formula that expresses the codes of 
nonprovable formulas, where would you encounter a problem? 



Lesson 12: Godel's Second Incompleteness Theorem 



What Godel's Second Incompleteness Theorem says is that Peano Arithmetic (P.A.) 
cannot prove its own consistency. This result can be generalized by paying attention to 
which properties of PA. we actually use in the proof. Then we can say of every theory 
that has those properties that it cannot prove its own consistency. But for simplicity, I 
will confine attention to P.A. I am ripping this presentation pretty much straight out of 
Smullyan, pp. 106-109, but I am adding some material that I took from George Boolos, 
The Logic of Provability, Cambridge University Press, 1994. 

Provability predicates 

Recall that if X is an expression, then X is the numeral denoting the code of that 
expression, #(X). 

Definition: A formula P(x) is a provability predicate for a theory Th if and only if the 
following three conditions hold: 

PI : If X is provable in Th, then P( X) is provable in Th. 
P2: (P( X -» Y ) (P(X) P( Y))) is provable in Th. 

P3: P( X) — P( P(X) ) is provable in Th. 

Theorem: If P(x) is a provability predicate for Th, then the following three facts hold: 

P4: If (X Y) is provable in Th, then (P( X) P( Y)) is provable in Th. 
P5: If (X (Y Z)) is provable in Th, then (P(X) (P( Y) P(Z))) is provable 
in Th. 

P6: If (X (P( X) Y)) is provable in Th, then (P( X) P( Y)) is provable in Th. 
Proof: 

P4: Suppose (X -» Y) is provable. 
By PI, P( X -» Y ) is provable. 

By P2 and Modus Ponens, (P( X) P( Y)) is provable. 
P5: Suppose (X -*- (Y -* Z)) is provable. 

By P4, (P( X) P( Y^Z )) is provable. 

By P2, (P( Y -» Z ) (P( Y) P( Z ))) is provable. 
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So, by propositional logic, (P( X) -*■ (P( Y) -» P( Z ))) is provable. 
P6: Suppose (X (P( X) Y)) is provable. 

By P5, (P(X) — (P( P(X) ) -* P(Y)) is provable. 

By P3, P( X) -h. P( P(X) ) is provable. 

So, by propositional logic, (P( X) -» P( Y)) is provable. 

Theorem: The 2i-formula Prov(x) that expresses the set of codes of formulas provable 
in P.A. is a provability predicate for P.A. 

Note: Is there a 2i-formula that expresses the set of codes of formulas provable in P.A.? 
Yes, in Lesson 8 we showed that there is a 2-formula that expresses the codes of 
formulas provable in P.A, which means, by Smullyan's theorem, that there is a 2i- 
formula that expresses it. 

Proof: We will prove only that condition PI and will sketch a proof that P2 holds. 
For a sketch of a proof that P3 holds (in the context of a different system of Godel 
numbering), see George Boolos, The Logic of Provability, op. cit. For a detailed 
proof, one must apparently go to David Hilbert and Paul Bernays, Grundlagen der 
Mathematik, vol.1, 1934, vol. 2, 1939 (although I have never tried that myself). 

Since Prov(x) is 2i, there is some 2 -formula Pf(x, y) such that Prov(x) is 3yPf(x, 
y). (Formerly, we said that Prov(x) was 3y(Pf(y) a x In y), but now I will abbreviate 
the latter.) Since P.A. extends Q (by the addition of the induction axioms) and Q 
extends R.A. (Lesson 11) and R.A. is 2 -complete (Lesson 9), P.A. proves every true 
2o-sentence. 

PI: Suppose X is provable in P.A. In that case Prov( X) is true. Since Prov( X), i.e., 
3yPf( X,y), is true, there is some number n such that Pf( X, n ) is true. Since P.A. 
proves every true 2 -sentence, Pf( X, n ) is provable in P.A. Therefore 3yPf( X,y), i.e, 
Prov( X), is provable in P.A. 

P2: In other words, we want to show that the following is provable in PA.: 



(i) (3zPf( X - Y , z) - (3zPf( X, z) - 3zPf( Y, z))) 

To see that (i) is provable, observe, first of all, that it is true. Where m 5 is the 
numeral that denotes the code of a proof of X -* Y and n is the numeral that denotes 
the code of a proof of X, a numeral that denotes the code of a proof of Y will be 
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m n Y6. (For notation, see Lesson 5). The following 2 -fbrmula expresses the 
relation between the proof of X -» Y, the proof of X and the proof of Y. 

3m < z(z = m6 a o = mn Y6) 

Abbreviate this as ConP(z, n, o). The following sentences are provable in P.A. (we 
assume without proof): 



VzVz'(Pf( X -» Y , z) -» 3z"ConP(z, z', z")) 

VzVz'Vz"(Pf( X -» Y , z) a Pf(X, z') a ConP(z, z', z")) — Pf( Y, z")). 

From these two sentence, it follows by logic that (i) is provable in P.A. 

Comment on P3: The proof of this requires some instances of the induction scheme 
of P.A. So we cannot get by with appealing to the 2 -completeness of R.A., as we 
did in proving that Prov(x) satisfies PI and P2. 



The unprovability of consistency 

We will assume that P.A. is consistent. We assume that Prov(x) is the 2i-formula that 
expresses the set of codes of theorems of P.A. 

Let _L abbreviate some false sentence in the language of arithemetic, such as = 1 . If 
P.A. is inconsistent, then every formula in the language of arithmetic is a theorem, 
including J_. So if we can prove that _L is not provable in PA., then we will prove that 
P.A. is consistent. So we can think of the statement "± is not provable in P.A." as 
asserting the consistency of P.A. Further, since Prov(x) expresses the set of codes of 
formulas provable in PA., if the sentence -■ Prov( _L ) is provable in PA., then P.A. can 
be said to prove its own consistency. We will see that it does not do that. 

Lemma 1: No fixed point in P.A. for -■ Prov(x) is provable in P.A. 

Proof: Let G be a fixed point for -. Prov(x). So (G - Prov( G )) is provable in 
P.A. Suppose, for a reductio, that G is provable in P.A. Then, since Prov(x) is a 
provability predicate for PA., it follows, by property PI of provability predicates, 
that Prov( G ) is provable in P.A. But since (G -> Prov( G )) is provable in P.A., 
-■Prov( G ) is provable in P.A. But P.A. is consistent. Contradiction! 
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Lemma 2: If G is a fixed point for -> Prov(x), then (-> Prov( 1 ) -*- G) is provable in P. A. 

Proof: Suppose G is a fixed point for -> Prov(x). 
So (i) (G - Prov( G )) is provable in P.A. 

Since 1 is refutable in PA., (ii) (-> Prov( G ) (Prov( G ) -» _L)) is provable in P.A. 
By (i) and (ii), (iii) (G -» (Prov( G) -» 1)) is provable in P.A. 
By (iii) and property P6 of provability predicates, (iv) (Prov( G ) -*■ Prov( 1 )) is 
provable in P.A. 

By(iv), (-.Prov(I) -» -Prov(G)) is provable in P.A. 
By (i) and (iv), (-.Prov(I) -» G) is provable in P.A. 

Godel's Second Incompleteness Theorem: The sentence -> Prov( 1) is not provable in 
P.A. (In other words, P.A. does not prove its own consistency.) 

Proof: P.A. is a consistent extension of R.A. So by the Upper Diagonal Lemma, 
there is a fixed point G for -> Prov(x). So: 

(G -Prov(G)) 

is provable in P.A. By Lemma 2, (-> Prov( -L) -> G) is provable in P.A. But by 
Lemma 1, G is not provable in P.A. So -> Prov( 1) is not provable in P.A. 

Note I: This result does not say that we cannot prove the consistency of Peano 
Arithmetic. All it says is that Peano Arithmetic does not prove its own consistency. We 
can prove the consistency of Peano Arithmetic. We specify a structure for the language 
of arithmetic and then show that all of the axioms of P.A. are true in that structure. Of 
course, in doing this, we will have to take for granted a theory of truth for the language of 
arithmetic, and if the structure we choose is the "intended interpretation" of the language 
of arithmetic, then to show that each of the axioms is true in the structure, we will have to 
take for granted some facts about natural numbers. 

Note 2: The more general statement that can be proved in the same way is that if Th is 
consistent and has a provability predicate and diag is strongly definable in Th, then Th 
does not prove its own consistency. 



Lesson 13: Second-order Logic 



In second-order logic, we have variables that hold the place of predicates and function 
symbols and we can bind those variables with quantifiers. This allows us to "say" things 
that we could not say otherwise. The quantifiers that bind these variables are called 
second-order quantifiers. The languages that contain second-order quantifiers are called 
second-order languages. Second-order logic is the logic of second-order languages (a 
logic I will define below). 

For example, we can state Leibniz's law, the identity of indiscernibles. 
VxVy(VF(F(x) ^ F(y)) -» x = y) 

For another example, instead of having infinitely many instances of the Peano induction 
scheme, we can have a single sentence — the second order Peano induction scheme — 
that says everything that is said by the formulas in that infinite collection of formulas: 

VF(F(0) - (Vx(F(x) - F(x')) - VxF(x))) 

Suppose we take this sentence and replace the remaining nonlogical constants with 
appropriate variables and existentially quantify. This gives us: 

(i) 3z3gVF(F(z) - (Vx(F(x) - F(g(x))) - VxF(x))) 

This sentence (as well as the second order Peano induction scheme) is true in a structure 
if and only if the the domain of that structure is countable (either finite or denumerably 
infinite). (For proof, see the "axiom of enumerability" in Boolos and Jeffrey, chapter 18.) 
In Lesson 4, 1 showed you a set of sentences that can all be true only in an infinite 
domain: 

(ii) Vx->Rxx 

(iii) Vx3yRxy 

(iv) VxVyVz((Rxy a Ryz) Rxz) 



So the set of sentences containing (i)-(iv) is satisfiable only in structures having a domain 
that is at least denumerably infinite. 
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We can even write sentences that violate the Lowenheim-Skolem Theorem. For 
example, the negation of sentence (i) will be true only in domains having a cardinality 
greater than that of the set of natural numbers (nondenumerably infinite). 

Also, there seem to be ordinary sentences of English the meanings of which might best be 
expressed using second-order quantifiers. For example: 

Some critics admire only one another. 

3F((Vx(F(x) -> x is a critic) a Vy(3z(F(z) a z admires y) -> F(y))) 

Now, one question you might have right away is: Why can't we do the same work with 
sets? For example, why couldn't we express the identity of indiscernibles as follows: 

VxVy(Vs(x es^yes)^x = y) ? 

Well, maybe for some purposes we could do that (for example for purposes of expressing 
the sentence about critics). But for other purposes we cannot. Compare the following 
two sentences, one of which uses second-order quantification and the other of which uses 
set membership: 

(i) 3F3xF(x) 

(ii) 3s3x x G s 

These two sentences fail to be equivalent. We cannot say that they are true in exactly the 
same structures. The reason is that "G" is a vocabulary item that can be differently 
interpreted in different structures. So while (i) is (as we will see) a second-order valid 
sentence, (ii) is not. 

Grammar 

Let's suppose we have a second-order language, £ 2 . I won't bother to write out the 
definition of well-formed formula or sentence for £ 2 . I'll assume that you can recognize 
a formula of X 2 well enough by understanding that we can have variables in place of 
function symbols and predicates and have quantifiers that bind those variables. These 
variables will be called predicate variables and function variables. The things that we 
formerly called "predicates" will now be called predicate constants, and the things we 
formerly called "function symbols" will now be called function symbols, as before, or 
function constants. The things that we formerly called "variables", will now be called 
individual variables. 
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We will adopt one new notational convention. We will put superscripts on predicate 
variables indicating their adicity. So F" is a variable for which we substitute n-ary 
predicates. And f" is a variable for which we substitute functions symbols for functions 
taking n arguments. 

For simplicity I will go on assuming that the only logical constants are -■ , -» and V and 
that the other familiar logical constants are used to write abbreviations. 

Semantics 

As for first-order languages, a structure for a second-order language is a pair TO = (D, 2) 
consisting of a domain D of objects and an assignment 2. Just as before, 2 assigns to 
each individual constant a member of D, to each n-ary predicate an n-ary relation on 
members of D, and to each n-ary function symbol a function of n arguments on D. 

But now we have to extend the concept of a variable assignment to include assignments 
to predicate and function variables. So a variable assignment in a structure ID is a 
function g such that: 

(i) for each individual variable v, g(v) G D, and 

(ii) for each predicate variable F", g(F") is a set of n-tuples of members of D, and 

(iii) for each function variable f", g(f ) is a function of n arguments having the set of 
n-tuples of members of D as its domain and D as its range. 

We assume that functions are total. That is, each n-ary function yields a value for every 
n -tuple of members of D. 

We also need to have the concept of a variant of a variable assignment that allows 
variations on the assignments to predicate variables and function variables as well as 
variations on the assignments to individual variables. Thus: 

g[v/o] is the variable assignment just like g except that it assigns o to v. 
g[F n /R] is the variable assignment just like g except that it assigns the set of n-tuples 
R to F". 

g[f/fun] is the variable assignment just like g except that it assigns the function of n 
arguments fun to f . 
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In terms of g and 2, we define a term assignment much as before. However, we now 
need a broader definition of term. On the first page of Lesson 5 I gave you a definition of 
term that included terms formed from function symbols. Take that as your definition of 
individual term, but allow "f" in that definition to include function variables. The terms 
are then such individual terms as well as predicate variables and predicate constants. 

Where g is a variable assignment in TO and 2 is an assignment for TO, and t is a term 
of £ 2 , his a term assignment in TO if and only if for all terms t of £ 2 , either: 

(i) t is an individual variable and h{\) = g(t), or 

(ii) t is an individual constant and /z(t) = 2(t), or 

(iii) t is a function variable and h{\) = g(t), or 

(iv) t is a function constant and h(t) = 2(t), or 

(iv) for some individual terms ti, t2, ... t„ and some function variable or constant f, 
and some function fun of n arguments, t = f(t-i, \.2, ■■■ t n ), and h(T) =fun and h{\) 
=fun(h(U), h({ 2 ), h(t n )), or 

(v) t is a predicate variable and h{\) = g(t), or 

(vi) t is a predicate constant and h{\) = 2(t). 

Next, we define satisfaction in a structure by a variable assignment in the usual way: 

(i) Where R is either a predicate variable or predicate constant and ti , t2, . . . , t n 
individual terms, g satisfies R(ti , t2, . . . , t n ) in TO if and only if (h(U), hfe), • • • , 
h(t n )) G h(R). 

(ii) g satisfies a formula -> P in TO if and only if g does not satisfy P in TO. 

(iii) g satisfies a formula (P -* Q) in TO if and only if either g does not satisfy P or g 
satisfies Q. 

(iv) g satisfies a formula VvP if and only if for all o G D, g[v/o] satisfies P. 

(v) g satisfies a formula VfP if and only if for all functions of n arguments fun with 
domain and range in D, g[f/fun] satisfies P. 

(vi) g satisfies a formula VF"P if and only if for all sets of n-tuples R of members of D, 
g[F n /R] satisfies P. 

Existential quantification is understood accordingly. For instance, g satisfies a formula 
3F"P if and only if for some set of n-tuples R of members of D, g[F n /R] satisfies P. 
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We define truth, thus: A formula P of £ 2 is true in a structure TO if and only if every 
variable assignment in TO satisfies P. (Notice that this allows a formula that is not a 
sentence to be true.) 

We define logical validity in the usual way: An argument in L is second-order valid if 
and only if for every structure 710 for £} and every variable assignment g in 710, if g 
satisfies every premise in 710, then g satisfies the conclusion in 710 as well. If an argument 
in £} having premises A and conclusion Q is second-order valid, we write A (= 2 Q. 

Similarly, we say that a set of sentences A of £ 2 is second-order satisfiable if and only if 
there is a structure 710 and a variable assignment g in 710 that satisfies every member of A 
in 7ft. 

Noncompactness 

Recall what we mean by compactness. We said (in Lesson 4) that first-order logic is 
compact because for every set of sentences A if every finite subset of A is first-order 
satisfiable (consistent), then A itself is first-order satisfiable. But we find that a similar 
claim cannot be made about second-order logic. (Here I follow the presentation in 
Enderton, p. 271.) 

Observe, to begin, that for every n > 2, we have a first-order sentence ~k„ which says 
"There are at least n things". For example, X.3 is: 

3x3y3z(x ^yAX^ZAy^z) 

If you think about it, you will see that the infinite set of sentences { X.2, A.3, A4, . . . } is 
satisfiable in all and only structures having domains that are at least denumerable in 
cardinality (size). Here is a single sentence of second-order logic that is also satisfiable 
in all and only structures having domains that are at least denumerable: 

3F 2 [VuVvVw((F 2 uv a F 2 vw) F 2 uw) a Vu-F 2 uu a Vu3vF 2 uv] 

What this says is that there is a relation which is transitive and irreflexive, and every 
individual stands in that relation to some (other) individual. Here is a simpler sentence 
that also is satisfiable in all and only structures having domains that are at least 
denumerable: 



K: 3f 1 [VuVv((f 1 (u) = f 1 (v)) — u = v) a 3wVz f 1 (z) t w] 
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What this says that there is a function f that never yields the same output for two inputs 
and yet there is one object that is not in its range (a first member of the series). If we 
negate this sentence, the result will be a sentence, X», that is true only in structures 
having finite domains. Consequently, the following set of sentences is not second-order 
satisfiable: 

A = {-•X., A-2, A.3, X,4, . . . } 

Consider any finite subset B of A. That subset may contain but there will be some 
largest m such that the subset contains ~k m . So every member of B will be true in a 
structure having a finite domain containing m or more members. So every finite subset of 
A is satisfiable, but A is not itself satisfiable. 

So second-order logic is not compact. We cannot say that for every set A of second-order 
formulas, if every finite subset is satisfiable, then A itself is satisfiable. (When we 
discussed compactness in Lesson 4, we were thinking of satisfiability as a property of 
sets of sentences. Now we are thinking of it as property of sets of formulas.) 

Recall that we had a second formulation of compactness. We could say of first-order 
logic that for every first-order sentence Q and every set of first-order sentences A, if 
A |= Q, then there is some finite subset B of A such that B (= Q. 

We cannot say this of second-order logic. Let A = |X 2 , X 3 , X 4 , . . . } . Obviously, A (= 2 X«, 
because, as I said, A and X. are both satisfiable in exactly the structures with domains that 
are at least denumerable. But if B is a finite subset of A, then there will be a structure 
having a finite domain in which every member of B is true, and X. will be false in that 
domain. 

Categoricity 

The Lowenheim-Skolem theorems of lesson 4 (downward and upward) show that there 
are definite limits to the extent to which a set of sentences of a first-order language can 
fix the size of the domains of the structures that satisfy it. In second-order logic, by 
contrast, we can have theories that fix the cardinality of the domains that satisfy them, 
even when those domains are infinite. Indeed, we can we can write theories in second 
order logic that are satisfiable only in infinite domains but which are such that any two 
structures that satisfy them are isomorphic to one another. (See Lesson 4 for the 
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definition of isomorphism.) When a theory has this property (all structures that satisfy it 
are isomorphic to one another), we say that it is categorical. 

Let L\ be the language of second-order arithetic (just like the language of arithmetic, 
except that it includes second-order variables and quantifiers). Let second-order Peano 
arithmetic, PA 2 , be the conjunction of the following two sentences of l3 A : The universal 
closure of the conjunction of the nine axioms of Q (see Lesson 11), and the second-order 
induction scheme: VF(F(0) -> (Vx(F(x) -> F(x')) -> VxF(x))). Obviously PA 2 is true on 
the intended intepretation, viz., the structure having as its domain the set of natural 
numbers and which assigns zero to 0, assigns the successor function to ', assigns the 
addition function to +, and assigns the multiplication function to V It can be proved that 
every structure that satisfies PA is isomorphic to the intended interpretation. (For a 
proof, see Boolos and Jeffrey, chapter 18, "Second-order logic," or Shapiro, pp. 82-83. 
Actually, the proof does not even need N7-N9.) 

Axiomatizability 

The question to be considered next is whether second-order logic is axiomatizable. We 
can take this question in two versions. 

First version: Can we write a finite set of axiom schemata and finite inference rules such 
that all and only second-order valid arguments are provable using axioms and inference 
rules of those kinds? 

Let us be a little more precise. I will assume that you know what it means to say that a 
sentence has a certain form. I will assume that you know what it means to say that a 
subproof has a certain form. I will say that an inference rule is any rule that tells us that 
given finitely many premises having certain forms and finitely many subproofs having 
certain forms, we may derive a conclusion having a certain form. (An axiom scheme 
may be treated as the special case of an inference rule that says that a conclusion of a 
certain form may be derived from no premises and subproofs at all.) Further, I will 
assume that you know what a proof 'is, defined in terms of such inferences rules. For the 
definition, see Lesson 2. (That definition pertained only to our Barwise and Etchemendy 
inference rules, but the definition can be generalized.) 

In these terms the question can be put this way: Is there a set of inference rules for £ 2 
such that: for every argument in £} there is a proof m this sense of the conclusion from 
the set of premises using these rules if and only if the argument is second-order valid? In 
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other words, can we have a set of inference rules that is sound and complete with respect 
to second-order validity? 

In view of the noncompactness of second-order logic, the answer to this question is 
clearly no. Suppose, for a reductio, that the answer is yes. Then there is a set of 
inference rules such that for any set of sentences A of I? and any sentence Q of X 2 
A |= 2 Q if and only if there is a proof of Q from A using only those inference rules; in 
symbols: A |- 2 Q- We show that, contrary to what we have seen in the discussion of 
compactness, if A (= 2 Q, then there is a finite subset of B of A such that B |= 2 Q. Suppose 
A |= 2 Q. By hypothesis, there are proofs for all second-order valid arguments; so 
A \- 2 Q. But proofs are finite. So at most finitely many members of A are used in 
constructing a proof of Q from A. So there is a finite subset B of A such that B \- 2 Q. By 
hypothesis, there are proofs only for second-order valid arguments: so B (= 2 Q. 

But what is the significance of the fact that there is a set of inference rules such that there 
is a proof for an argument using those inference rules if and only the argument is valid? 
From one point of view, the significance of that fact is only that if there is a such a 
decidable system of inference rules then the set of valid arguments (having finite sets of 
premises) can be effectively enumerated. (If there is such a system of inference rules, 
then we will be able to show that the set of codes of valid arguments is But in 
principle there could be ways of enumerating the valid arguments (with finite sets of 
premises) other than one that rests on there being such a decidable set of inference rules. 
So a question of greater interest is this: 

Second version: Is the set of second-order valid sentences of L^ A effectively 
enumerable? 

Suppose that Q is a sentence of J? A that is true in the intended interpretation of J? A . Then 
Q is true in every structure for £ A that is isomorphic to the intended intepretation (see 
Lesson 4 for the idea of the proof). So since PA is true on the intended interpretation 

2 2 

and, as we have seen, PA is categorical, PA (= 2 Q. Thus, every true sentence of 
arithmetic is a semantic consequence of PA . Given this fact, we can prove that the set of 
second-order valid sentences of J? A is not effectively enumerable. 

Suppose, for a reductio, that the second-order valid sentences of J? A can be effectively 
enumerated. If Q is first-order sentence of J3 A , then either Q or -> Q is true on the 
intended interpretation. So, by what we have just stated, either PA |= 2 Q or PA |= 2 
-■Q. So either (PA 2 -*- Q) or (PA 2 -» ->Q) is second-order valid. So if the set of second- 
order valid sentences of J? A were effectively enumerable, either (PA 2 -> Q) or (PA 2 -> 
-> Q) would eventually show up in the enumeration. If (PA -> Q) shows up, then we 
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will know that Q is true. If (PA 2 -* -> Q) shows up, then we will know that Q is not true. 
So the set of first-order truths of arithmetic would be decidable. But since the set of first- 
order truths of arithmetic is not arithmetic (by Tarski's undefinability theorem, which still 
holds), and therefore not recursive, we know, by Church's thesis, that the set of first- 
order truths of arithmetic is not decidable. So the set of second-order valid sentences of 
J3 A cannot be effectively enumerated. Consequently, the set of second-order valid 
sentences of any other second order language at least as rich in predicate and function 
symbols as J? A either. 

Exercise: Suppose we have a Godel numbering for the set of expressions of the language 
of second-order arithmetic, L\. We can use the formulas of the language of first-order 
arithmetic La to express sets of these Godel numbers. We have also learned (though we 
did not prove it) that every member of ^ATis a second-order consequence of PA 2 . Use 
these facts to show that the set of codes of second-order valid formulas of L\ is not 
arithmetic. Hints: Modify the proof of the undecidability of first-order logic, Lesson 11, 
as needed. Notice that PA 2 is a particular sentence of l3 A and so has a particular code. 
Recall that the set of codes of first-order truths of arithmetic is not arithmeric (by Tarski's 
Undefinability Theorem). 
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Modal logic is the study of the logic of necessity and possibility. The symbol that means 
"It is necessary that. . ." is □ (the box), and the symbol that means "It is possible that. . ." 
is (the diamond). These two symbols will be called modal operators (although strictly 
speaking we should call them modal connectives). We begin with propositional modal 
logic, which is the study of languages containing modal operators and other sentential 
connectives but no quantifiers. 

First, we will define a language of propositional modal logic. Then we will define the 
conditions under which a sentence of the language is true in a modal structure. Finally, 
we will define several different concepts of logical validity. What we understand a 
modal operator to mean will depend on which definition of logical validity we accept. 

The Grammar 

Let the atomic sentences of TM be A, B, C, . . . Say that P is a sentence of TM if and 
only if either (i) P is an atomic sentence of TM, or (ii) P = -> Q and Q is a sentence of 
TM, or (hi) P = (Q -* R) and both Q and R are sentences of TM, or (iv) P = DQ and 
Q is a sentence of TM. 

We will treat ^P as an abbreviation of -■ □ -> P. 
Truth Conditions 

A frame (or Kripke-frame) is a pair {W, R), where Wis a nonempty set of possible 
worlds, and R is a binary relation on members of W(i.Q., a set of pairs of members of W). 

A valuation V in a frame (W, R) is a function that takes each pair consisting of an atomic 
sentence of TM and a world in W as arguments and yields either the truth value T or the 
truth value F as output. 

For example, we might have a structure where W= {w\, w 2 , w 3 }, R = {{w\, W\), {w\, W2), 
(w 2 , w 3 ), (w 3 , wi)}, and V(A, w\) = T, V(A, w 2 ) = F, V(A, w 3 ) = T,.... 

R is called an accessibility relation, and if {w u wj) G R, then we say that Wj is accessible 
from Wi. We abbreviate (w„ wj) G R thus: WiRwj. 
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For emphasis: If WiRwj, then Wj is accessible from w t . 

We define the conditions under which sentences of TM are true at a world wona 
valuation Fin a frame {W, R) as follows: 

(i) If P is atomic, then P is true at w G Won Fin (W, R) if and only if V(P, w) = T. 

(ii) If P = ->Q, then P is true at w G Won V\n{W, R) if and only if Q is not true at w G 
Won Vm(W,R). 

(iii) If P = (Q -> R), then P is true at w G W on V in (W, R) if and only if either Q is not 
true at w G Won Fin {W, R) or R is true at w G JFon Fin (*F i?). 

(iv) If P = DQ, then P is true at w G ^ on Fin (fF, i?) if and only if for a// w'G JF, if 
wi?w', then Q is true at w'G Won Fin (*F 

A consequence is that: 

(v) If P = ^Q, then P is true at w on V in ( fF if and only if for some w'ZEW, wRw \ 
and Q is true at w ' on F in (JF 

Proo/ OQ abbreviates - □ - Q, and, by (2) and (4), - □ - Q is true at w on F in ( W, R) if 
and only if for some w'ZEW, wRw', and Q is true at w'on Vin{W, R). 

A sentence is false at a world on a valuation in a frame if and only if it is not true at that 
world on that valuation in that frame. 

If we suppose that one of the worlds in Wis the actual world (the world that really 
exists), then we can say that A is true (simpliciter) if and only if A is true at the actual 
world. 

(From now on, I will just assume that the worlds we're talking about are in Win the 
frame we're talking about.) 

For example: 




W>4 
A, 



(Next page for explanation) 
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W= {wi, W2, Ws, W4}, and R is represented by the arrows. ()A is true at wj in this frame, 
because A is true at w>2 in this frame, and W1RW2. DA is false at wi in this frame, because 
A is false at ws in this frame, and W1RW3. However, DB is true at wi in this frame, 
because B is true at every world w G JFsuch that wiRw. (B is false at W4, but it does not 
matter, since W4 is not accessible from wi. We are not assuming - at this point - that 
accessibility is transitive.) 

Kinds of Frame 

Towards defining different kinds of validity, we define different kinds of frame: 

A frame (W, R) is reflexive if and only if for all w£EW, wRw. (Every world is accessible 
from itself.) 

A frame ( W, R) is symmetric if and only if for all w, w ' G W, if wRw ' then w Rw. (If w ' is 
accessible from w, then w is accessible from w'.) 

A frame (W, R) is transitive if and only if for all w, w', w"G W, if wRw' and w'Rw", then 
wRw ". (If w " is accessible from w ' and w ' is accessible from w, then w " is accessible 
from w.) 

A frame is a T-frame if and only if it is reflexive. 

A frame is a 5-frame if and only if it is reflexive and symmetric. 

A frame is an S^-frame if and ony if it is reflexive and transitive. 

A frame is an S^-frame if and only if it is reflexive, symmetric and transitive. 

Note: In an S^-frame, Wean be divided into mutually exclusive cells such that Wis the 
union of all of the cells and such that for each cell CQW, for all w, w'G C, wRw'. In 
other words, if wi, W2 G Wand W1RW2, then for all W3 such that W1RW3 and for all W4 such 
that W2RW4, W3RW4, and if Wj, W2 G Wand ->wiRw2, then for all W3 such that W1RW3 and 
for all W4 such that W2RW4, -'W3RW4. 
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Kinds of Validity 

Let A be a set of sentences of TM and Q be a sentence of TM. 

The argument having the sentences in A as premises and the sentence Q as its conclusion 
is K-valid (A \= kQ) if and only if for every frame (W, R) and every valuation V'm{W, R) 
and every world w£EW,if every member of A is true at w on V'm{W, R), then Q is true 
at won Fin(JF, R). 

The argument having the sentences in A as premises and the sentence Q as its conclusion 
is T-valid {A (= r Q) if and only if for every T- frame (W, R) (reflexive) and every 
valuation Vin{W, R) and every world w£EW,if every member of A is true at won Vin 
(W, R), then Q is true at w on Vin (W, R). 

The argument having the sentences in A as premises and the sentence Q as its conclusion 
is 5-valid (A \=bQ) if and only if for every 5-frame (W, R) (reflexive and symmetric) and 
every valuation Vin(W, R) and every world w£EW,if every member of A is true at w on 
Vin (W, R), then Q is true at w on Vin (W, R). 

The argument having the sentences in A as premises and the sentence Q as its conclusion 
is S^-valid (A (= S4 Q) if and only if for every 5*4-frame (W, R) (reflexive and transitive) 
and every valuation Vin(W, R) and every world w£EW,if every member of A is true at w 
on Vin (W, R), then Q is true at w on Vin {W, R). 

The argument having the sentences in A as premises and the sentence Q as its conclusion 
is S^-valid {A (= S5 Q) if and only if for every 6*5-frame (W, R) (reflexive and symmetric 
and transitive) and every valuation Vin{W, R) and every world w £ W, if every member 
of A is true at w on Vin (W, R), then Q is true at w on Vin {W, R). 

Proof theory 

Corresponding to each of these validity concepts, we specify a set of axioms such that all 
and only the arguments that are valid in that sense are provable using those axioms and 
Modus Ponens. 

At the core of each set of axioms are the axioms of PL from Lesson 5, which I repeat 
here: 
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The axiom system PL (for TM): 
LI: (P - (Q - P)) 

L2: (P - (Q - R)) - ((P - Q) - (P - R)) 
L3: ((^P-^Q)-(Q-P)) 

Every sentence of TM that has one of these three forms will be an axiom of PL (for 
TM). We could just as well say: Every tt-valid sentence of TM is an axiom, because 
for any language tt-validity is decidable. In any case, we will assume that every tt-valid 
sentence can be derived from axioms of these three forms by means of repeated 
applications of Modus Ponens. 

The System K 

The system K adds to the axioms of PL one axiom scheme and one inference rule. The 
axiom scheme is: 

L4: (D(P -» Q) -» (DP DQ)) 

This is called the characteristic axiom scheme of K. The additional inference rule is: 

Necessitation: If P can be derived from axioms only (no special premises), then 
infer DP. 

When there exists a proof of Q using only sentences in set A and axioms of any of the 
forms, L1-L4, Modus Ponens and Necessitation, we write A \-k Q. 

We say that K = PL + L4 + Necessitation. 

As a matter of fact, A \- K Q if and only if A \=kQ, but we won't prove that. In other 
words, proof in K is sound and complete with respect to invalidity. 

Exercise 1: Show that D(P -* Q) \=k (DP -* DQ). (Hint: Suppose that the premise is 
true at some world on some valuation in some iT-frame and that the conclusion is false at 
that world on that valuation in that iT-frame, and then derive a contradiction.) 

Exercise 2: Show that DP \k K P. (So the box does not yet behave very much like a 
necessity operator.) 
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Solution: Let W= {wi, W2}. Suppose that R = {{wi, W2)}. (Note that R is not 
reflexive.) Suppose V(k, wj) = F, but V(k, W2) = T. Since for every w G fFsuch that 
wiRw (namely, W2 only) A is true at w, DA is true at wi. But A is not true at wi. 

The System T 

The system T adds the following axiom scheme: 

L5: (DP P) 

This is the characteristic axiom scheme ofT. So T= K + L5. 

As a matter of fact, A \- T Q if and only if ,4 (= r Q, but we won't prove that. 

Exercise 3: Prove that DP |= T P. 

Solution: Let (PF, i?) be a T-frame, and suppose that Fis a valuation such that DP is 
true at w on Vin (W, R). Then, by condition (iv) in the definition of truth, for every 
w'G fFsuch that wRw\ P is true at w' on V'm (W, R). But since (W, R) is reflexive in 
every /"-frame, wRw. So P is true at w on Vin (W, R). 

Exercise 4: Prove that P T \J()P. 

Solution: Suppose W= {wi, W2} and R = {(wi, Wi), {w2, W2), (wi, W2)}. (W, R) is a T- 
frame since R is reflexive. (But note that R is not symmetric.) Suppose V{k, wi) = T 
and V{k, wi) = F. A is true at w; on Fin (JF, R). But the only world accessible from 
W2 is W2 itself, and A is false at W2. So ^A is false at W2 on Vin {W, R). But wiRwf, 
so we cannot say that for all w^W, if WiRw, then ()k is true on Fin (W, R). So D^A 
is not true at wi on Vin (W, R). 

Exercise 5: Prove that DP \t T DDP. 

The System B 

The system B adds to the system T the following axiom scheme: 



L6: (P - OOP) 
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This is the characteristic axiom scheme of B. So B = T + L6. 

As a matter of fact, A \- B Q if and only if A ^ B Q, but we won't prove that. 

Exercise 6: Prove that P (= g D<)P- 

Solution: Suppose that P is true at w on Fin a 5-frame {W, R). Consider an arbitrary 
world w'G fFsuch that wRw'. Since i?) is symmetric, w'Rw. So, since P is true 
at w, ()P is true at w '. So for every w'GW such that wRw ',()P is true at w '. So D<)P 
is true at w. 

Exercise 7: Prove that DP (£ B DOP. 

Solution: Let PT= {w?, m>2, wj} andi? = {(wi, wi), (w2, W2), (W3, W3), (wi, W2), (W2, 
wi), (w2, W3), {w3, W2)}. So {W, R) is a 5-frame, since it is reflexive and symmetric. 
(But note that (W, R) is not transitive.) Suppose V{k, w\) = T, and V{k, W2) = T, but 
V{k, W3) = F. Since only wi and W2 are accessible from wi, DA is true at wi. But 
since W2RW3 and A is not true at W3, DA is not true at W2. So since W1RW2 and DA is 
not true at W2, DDA is not true at wi. 



The System S4 

The system S4 adds to the system T the following axiom scheme: 
L7: (DP UUP) 

This is the characteristic axiom scheme of S4. So S4 = T+ L7. Notice that S4 builds on 
T, not B, and does not include L6. 

As a matter of fact, A \- S4 Q if and only if A \= S 4 Q, but we won't prove that. 

Exercise 8: Prove that DP \= S 4 DDP. 

Solution: Suppose COP is false at w on Fin an S^-frame (PF, R). So there is a 
world w'^lW such that wi?w ' and □ P is false at w '. So there is a world w " such that 
w Rw " such that P is false at w ". But ( W, R) is transitive. So, since wRw ' and w Rw ", 
wRw ". So □ P is false at w. 
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Exercise 9: Prove that P )fc S4 D()P. 

Solution: See the proof that P T D()P. 

The System S5 

The system S5 adds to the system T both the characteristic axiom of B and the 
characteristic axiom of S4: 

L6: (P - DOP) 
L7: (DP UUP) 

So S5=T+L6 + L7 = B + L7 = S4 + L6. 

As a matter of fact, A \- S s Q if and only if A \= S 5 Q, but we won't prove that. 

The Reducibility of Modalities in S5 

Let P^^Q mean: P \= S5 Q and Q \= ss P. 

Theorem (the reducibilities of modalities in S5): 

(0 OPHNDOP 

(ii) DPHhOnP 

(iii) OP =11= OOP 

(iv) UP=\\= UUP 

Before we prove this, let us contemplate what it means. It means that whenever we have 
a sentence that begins with a string of modal operators, we can find an equivalent formula 
that begins with just the last of those operators. For example: 



□OOdp HNOOQP =11= OOP =11= op 
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Now the proof: 

(i) R-L: By exercise 3, DQ |= S5 Q. So U()P |= S5 ()P. 

L-R: Suppose that is true at w. Then there is a world w ' accessible 

from w where P is true. But w' is accessible from every world accessible 
from w. So at every world accessible from w, ()P is true. So D^P is true at 
w. 

(ii) From (i) we have -•()-' Q =) (= -> □<)-■ Q. Applying the definition of 0, this 
gives us OQ =||= <)DQ. 

(iii) R-L: Suppose ()()P is true at w. So ^P is true at some world w' accessible from w. 

So P is accessible from some world w" accessible from w'. But w" is 
accessible from w. So ^P is true at w. 
L-R: By exercise 3, D-Q \= S5 -Q. So \= S5 SoQ(= SJ 0Q- So 

OP N S5 OOP. 

(iv) From (iii) we have -'0~' Q =H= "'OO - ' Q- Applying the definition of 0, this 
gives us DP =)!= UUP. 



Some Objections 

The contemporary literature on propositional modal logic recognizes nothing the least bit 
controversial. (As we will see, that is not the case when it comes to quantified modal 
logic.) Although I may be the only one, I think that actually none of these logics captures 
the logic of natural language modal operators. That is because what we really need is a 
three- valued semantics. 

First, let's compare a two-valued semantics to a three-valued semantics with respect to a 
couple of points: 

Inconsistency and implication: In a two-valued semantics, if P and Q are inconsistent, 
then P implies ->Q. If there is no valuation on which both P and Q are true, then for 
every valuation on which P is true ->Q is true too. But in a three-value semantics, this 
does not hold. There might be no valuation on which P and Q are both true, and yet 
there may be a valuation on which P is true and -> Q is neither true nor false. 

Contraposition: A two-valued semantics obeys the law of contraposition: If P implies 
Q, then -> Q implies -> P. (We made tacit use of this in the proof of the reducibility of 
modalities in S5.) But a three-value semantics need not obey this law. Suppose P 
implies Q, because on every interpretation on which P is true, Q is true. Then on every 
interpretation on which Q is not true, P is not true. But every interpretation on which 
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-> Q is true is an interpretation on which Q is not true. So every interpretation on which 
-> Q is true is an intepretation on which P is not true. But it does not follow that every 
interpetation on which -> Q is true is an interpretation on which -> P is true, because P 
could fail to be true by being neither true nor false, in which case -> P is neither true nor 
false as well. 

First problem case: 

Suppose I step outside and declare, "It's going to rain!" But then I look up at the sky, 
reconsider and say, "Well, it might not". Doesn't this count as taking back what I said in 
the first place? If so, there there is a kind of inconsistency between P and P. But in a 
two-valued logic, if P and Q are inconsistent, then P implies ->Q. We don't want to say 
that P implies ->^-> P, because that means that P implies DP, which is certainly not the 
case. So to maintain that P and ()-• P are inconsistent, we have to go to a three-valued 
semantics. 

Second problem case: 

If it is possible that it is possible that I will not go for a walk, then surely it is possible 
that I will not go for a walk. But suppose that is necessary that I will go for a walk. May 
I infer that it is necessary that it be necessary that I will go for a walk? No! The reason 
why it is necessary that I go for walk may be that I decided to go for a walk, and I always 
do what I have decided to do. But it is not necessary that it be necessary that I go for a 
walk, because I did not have to decide to go for a walk. So ^O -1 P implies §-> P, but DP 
does not imply DDP. But since the principle of contraposition holds in a two-valued 
semantics, the only way we can have the one implication without the other is by going to 
a three-valued semantics. 

Third problem case: 

I will be in Cincinnati on Friday. Therefore, it is necessarily possible that I will be in 
Cincinnati on Friday. But it may be necessary that I not be in Cincinnati on Monday. 
Does it follow that I will not be in Cincinnati on Monday? I think not. In other words, P 
implies 0()P, but ^D -1 P does not imply -> P. But here too, since the principle of 
contraposition holds in a two-valued semantics, the only way we can have the one 
implication without the other is by going to a three-valued semantics. 



Lesson 15: Quantified Modal Logic 



Here's our agenda: First we will look at the "obvious" way of interpreting sentences 
containing both quantifiers and modal operators. Then we will look at some reasons to 
be dissatisfied with that. Then we will investigate two alternatives (the "free-logical" 
alternative, and the "existence predicate" alternative). Finally, we will draw the 
distinction between rigid and non-rigid designation. We will concern ourselves 
exclusively with questions of semantics and will not deal with proof theory (axiom 
systems) at all. 

Throughout we will suppose that we have a language QJM, which, with respect to 
grammar, is just like a first-order language except that it contains the box, □, which 
behaves, grammatically just like the negation sign. We will dispense with function 
symbols for present purposes. 

Simple Quantified Modal Logic 

In Simple Quantified Modal Logic (a term I just made up), we simply add to each frame a 
set of objects, the domain. So a frame is now a triple, (D, W, R), where D is a nonempty 
set of objects, and, as before, Wis a nonempty set of worlds andi? is an accessibility 
relation on the worlds in W. For simplicity I will assume that for every frame (D, W, R), 
(W, R) is an S^-frame (and therefore, reflexive, symmetic and transitive). (This means 
that we could drop all mention of R and just assume that all worlds in Wweve accessible 
to one another.) 

We will define a modal structure as a quadruple (2, D, W, R), where (D, W, R) is a frame, 
as just defined, and 2 is an interpretation. An interpretation 2 is a function of two 
arguments. The arguments are either a world in Wand an individual constant, or a 
member of Wand a predicate (i.e., predicate constant). For each w G W and each 
individual constant n of QJM, 2(n, w) G D, and for each w G W and each n-axy predicate 
F of QJM, 2(F, w) = a set of n-tuples of members of D. 2(a, w) is the denotation of a at 
w, and 2(F, w) is the extension of F at w. 

Furthermore, we stipulate that for each individual constant n and for all w, w ' G W, 
2(n, w) = 2(n, w). (So it might have been simpler to have two kinds of interpretation, 
one for individual constants, which did not take worlds as arguments at all, and one for 
predicates which did take worlds as arguments.) This is what it means to say that an 
individual constant is a rigid designator: It denotes the same object at every world. 
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Note an ambiguity in the phrase "denotes at a world". What we really mean here is 
denotes relative to a world. There is a possible world in which "Barack Obama" is the 
name of Sarah Palin. We could say that in that world "Barack Obama" denotes Sarah 
Palin. But that's not the kind of thing we mean here when we speak of a name as 
denoting something at a world. In English, as it is spoken in the actual world, "Barack 
Obama" denotes Barack Obama, not Sarah Palin, relative to, or "at", every world. 

Similarly, we relativize variable assignments (in structures) to worlds. A function g is a 
variable assignment in a structure (2, D, W, R) if and only if for each variable v and 
world wGW, g(v, w) G D. We stipulate that for all w, w'G W, g(v, w) = g(v, w) The 
variant g[v/o] of variable assignment g is the variable assignment just like g except that 
for each w£EW, gfv/o](v, w) = o. 

Term assignments are defined in terms of variable assignments and interpretations. 
Where t is a term and g is some variable assignment in (2, D, W, R), 



A is a term assignment for (2, D, W, R) and g. 

Next, we define satisfaction of a formula by a variable assignment in a modal structure: 

For every formula P and every structure (2, D, W, R) and every world w£EW, and every 
variable assignment g in (2, D, W, R), g satisfies P in (2, D, W, R) at w if and only if: 

(A) P = R(ti , t2, . . . , t n ), where R is an n-ary predicate and ti , t2, . . . , t„ are n terms, and 
(h(U, w), h(\ 2 , w), h(t n , w)) G 2(R, w), or, 

(-■) P = -> Q and g does not satisfy Q in (2, D, W, R) at w, or 

(-*) P = (Q -» R) and either g does not satisfy Q in (2, D, W, R) at w or g satisfies R in 
(2, D, W, R) at w, or 

(V) P = VvQ and for all o G £>, g[v/o] satisfies Q in (2, D, W, R) at w, or 

(□) P = DQ and for all w ' G W, if wRw ' then g satisfies Q in (2, D, W, R) at w '. 




r 



2(t, w) if t is an individual constant. 



g(t, w) if t is a variable. 
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We say that a sentence P of QJM is true in (2, D, W, R) at w if and only if for every 
variable assignment g in (2, D, W, R) at w, g satisfies P in (2, D, W, R) at w. 

If A is a set of sentences of QJM and Q is a sentence of QJM, then A (= sqml Q if an d only 
if for every modal structure (2, D, W, R) and every w£EW, if every member of A is true 
in (2, D, W, R) at w, then Q is true in (2, D, W, R) at w. 



The Barcan and Converse Barcan Formulas: 

The problem with this shotgun wedding of quantifier domains and relativization of truth 
to worlds is that the Barcan and converse Barcan formulas turn out to be valid. (The 
validity of the Barcan formula was first discussed in the 1940's by Ruth Barcan Marcus.) 

The Barcan formula: 

(= sqml (VvDQ^ DVvQ) 

The Barcan formula (existential variant): 
F sqml 

The converse Barcan formula: 
F sqml (DVvQ -* VvDQ) 

The converse Barcan formula (existential variant) : 
(= sqml (3vOQ-03vQ) 

Proof of the Barcan Formula: Suppose VvDQ is true at w. Then for every variable 
assignment g and every object o G D, g[v/o] satisfies DQ at w. So for every variable 
assignment g and every object o G D, for every world w'G ^such that wRw\ g[v/o] 
satisfies Q at w'. So for every variable assignment g, for every w' G JFsuch that 
wRw ', for every object oGD, g[v/o] satisfies Q at w '. So for every variable 
assignment g, for all w 'G W such that wRw ', g satisfies VvQ at w '. So for every 
variable assignment g, g satisfies DVvQ at w. So □ VvQ is true at w. 

Proof of the Converse Barcan Formula: Similar. 



The essential point in both of these proofs is that the domain of objects that we have to 
"look at" in evaluating a quantified formula is entirely independent of the domain of 
worlds that we have to look at in evaluating a boxed formula. So we can reverse the 
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order of our metalinguistic quantifiers in formulating the satisfaction conditions of a 
universally quantified boxed formula or a boxed universally quantified formula. 

This does not mean that the order of any series of modal operators and quantifiers can be 
switched around. For example, we have: 

^s Q ML(n3vQ^3vDQ) 

Proof: Suppose W= {w, w r ),D= {a,6},2(F, w)= {(a)},2(F, w)= {(b)}. Let g be 
some variable assignment in this structure. g[x/a] satisfies Fx at w. So g satisfies 
3xFx at w. g[x/b] satisfies Fx at w'. So g satisfies 3xFx at w'. So g satisfies D3xFx 
at w. But g[x/a] does not satisfy Fx at w \ so g[x/a] does not satisfy DFx at w. And 
g[x/b] does not satisfy Fx at w; so g[x/b] does not satisfy DFx at w. So g does not 
satisfy 3xD Fx at w. So g does not satisfy (D3xFx -» 3xDFx) at w. So (D3xFx -» 
3xDFx) is not true at w. 

To see that this is a right result consider the following conditional: "If necessarily there 
is some number that is the number of planets, then there is some number that is 
necessarily the number of planets." Or: "If necessarily something exists, then there is 
something that necessarily exists". 

Counterexamples to the Barcan formula: Suppose that everything that actually exists has 
(rest) mass. It is plausible that each of those things necessarily has mass, because for 
anything that has mass in fact, it just would not be recognizable as the same thing if it did 
not have mass. So everything necessarily has mass. But even in that case there there 
could be a possible world in which massless things exist. So it is not the case that 
necessarily everything has mass. 

Or consider the Barcan formula in its existential variant. I do not have a twin brother. 
But my having a twin brother is a possibility. That is, there is a possible world in which I 
have a twin brother. So "It is possible that I have a twin brother" is true. But it does not 
follow that there is someone now, someone out there, maybe somewhere in France, of 
whom we can say: It is possible that he is my twin brother. So, "There is someone who 
is possibly my twin brother" is false. (Admittedly, the premise sounds a little odd, at 
least if we assume that I know that I do not have a twin brother. But suppose I don't 
know this. Perhaps I suspect that I might have a twin brother because once, when I was 
very young, I heard my mother and father saying some things that I did not exactly 
understand.) 
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Counterexamples to the Converse Barcan Formula: In each possible world, it is true that 
everything is something: Vx3y x = y. Since this is true at each world, we seem to have 
it that the following is true at our world: DVxBy x = y. But now look at that couch. 
That couch does not exist in every possible world, does it? I hope not! So we cannot say 
that in each possible world it is something. So the following sentence is false: 
VxCBy x = y. 

Modal Free Logic 

Some people say that what has gone wrong is that in SQML we have assumed that the 
domain of objects relative to which we evaluate quantified sentences is the same for 
every world. In other words, we have assumed that for each world the objects "in" that 
world are the same as the objects "in" every other world. What I am calling Modal Free 
Logic (MFL) remedies that purported error. The term "Free-logic" actually refers to a 
kind of semantics and matching proof theory in which we allow individual constants that 
do not denote anything. (The main exponents of this have been Karel Lambert and 
Ermanno Bencivenga.) We will not allow denotationless terms, but we have to modify 
Universal Elimination and Existential Introduction in ways that are similar to the way 
they are modified in Free Logic; hence the name. 

In Modal Free Logic, a structure is a quintuple (2, 6, D, W, R), where 2, D, W, and R are 
as before. 6 is a function on members of fFthat yields, for each member of W, a subset 
of D. So for each w £EW, b(w) Q D. We think of b(w) as the set of things that "exist" at 
w. We call D the outer domain (of the structure (2, 6, D, W, R)) and we call 6(w) the 
inner domain for w (in (2, 6, D, W, R)). So Chris Gauker is in the inner domain for the 
actual world, and Sherlock Holmes may be in the outer domain, but he is not in the inner 
domain for the actual world. 

(Or so they say. I myself don't understand what sense it makes to say that Sherlock 
Holmes — that very person — inhabits some possible world. "What person?", you may 
rightly ask. "There is no Sherlock Holmes!") 

The satisfaction conditions can be written the same way as for SQML, except that we 
have to put "(2, 6, D, W, R)" in place of "(2, D, W, Rf, and in place of (V) we have: 

(VF) P = VvQ and for all o G 6(w), g[v/o] satisfies Q in (2, 6, D, W, R) at w, or . . . 

In MFL, neither the Barcan Formula nor the Converse Barcan Formula is valid. To see 
that the Barcan Formula, (VvDQ -*- DVvQ), is not valid, consider a structure in which 
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there is a world w such that for all o G b(w), (o) G 2(F, w) and for all w', if wRw', then 
(o) G 2(F, w). In other words, every object in w is F in w and is F in every world 
accessible from w. VxD Fx will be true at w. But suppose also that in some worlds w\ 
there are objects in b(w) that are not in 2(F, w) (and so not in 5(w)). DVxFx will be 
false at w. 

To see that the Converse Barcan Formula is not valid, consider a structure in which for 
every world w£EW, for all o G 5(w), (o) G 2(F, w) but in which for some w£EW, and 
some w'G W, and some o' G b(w), (o)£ 2(F, w). (Recall that (PF, R) is an 55 frame.) 
□VxFx will be true at w \ but VxD Fx will be false at w'. In other words, at each world 
everything that exists at that world is F, but some things that exist at w ' are not F at w. 

But while MFL has these seemingly desirable results, it has has some strange results too. 
In particular, Existential Introduction is no longer valid: Q (£mfl 3vQv/n. For example, 
Fa |£mfl 3xFx. Suppose 2(a, w) G D, and (2(a, w)) G 2(F, w), but for all o G 6(w), (o) 
2(F, w). Then Fa will be true at w, but 3xFx will be false at w. The most we can say is 
that {Q, 3v v =n} |= M FL3vQv/n. (Similarly, we have VvQ (4iFLQn/v, but 
{VvQ, 3v v =n} (=mfl Qn/v.) 

In defense of this result, it may be said that "Santa Claus lives at the North Pole" is true, 
while "There exists an x such that x lives at the North Pole" is false. I don't see it. 
"Santa Claus lives at the North Pole" is false. (That's not to say that "It is not the case 
that Santa Claus lives at the North Pole" is true. Both sentences may be neither true nor 
false. To deal with a language containing names of fictions, we may need a three-valued 
semantics.) 

Offhand, you might think that it would help if we stipulated that each extension at a 
world must be formed from members of the inner domain for that world. That is, 2(F, w) 
must be a set of n-tuples formed from members of S(w), not merely members of D. Then 
if Fa is true, so that (2(a, w)) G 2(F, w), we can be sure that 2(a, w) G b(w). So 
Fa |= mfl 3xFx. But that only helps when the premise is an atomic sentence (and in 
certain other cases). We still have: -■ Ga |^mfl 3x-> Gx. If 2(a, w) ^ 5(w), then, by our 
stipulation about extensions, -> Ga will be true at w; but 3x-> Gx may still be false at w 
(and will be false if for all o G 6(w), (o) G 2(G, w)). 

Alternatively, we might stipulate that for every name a, 2(a, w) G 5(w). That will ensure 
that Existential Introduction is valid without restriction. But that solution carries costs of 
its own. To invalidate the Barcan formula we want to allow that for some w, w'b(w) 4- 
b(w). So if we stipulate that 2(a, w) G 5(w) and 2(a, w) G b(w% then we cannot 
stipulate that for all w, w \ 2(a, w) = 2(a, w), which means that we cannot stipulate that 
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names are rigid designators. The undesirable consequence of that is that, without further 
ado, we will not be able to prove the validity of Da = a. We might restore the necessity 
of identity by stipulating that, for each structure, the extension of "=" is the identity 
relation on the entire outer domain. But then we will have to decide whether we want 
a = a to imply 3x x = a or not, and, if the true identities include the likes of "Santa Claus 
= Santa Claus", then neither choice is entirely unobjectionable. 

Quantified Modal Logic with an Existence Predicate 

In defense of the Barcan formula and converse Barcan formula, it might be said that the 
problem is not that they are invalid, just that they are easily misinterpreted. If we read 
the quantifiers as saying "For everything that exists" and "There exists an x such that", 
then we will think that these formulas are invalid. But we can grant that the Barcan 
formula and the converse Barcan formula are valid if we read the quantifiers as "For 
everything possible" and "There is a possible thing x such that". 

As for our counterexamples to the Barcan formula, if everything that is even possible 
necessarily has mass, then there could not be a possible world in which something does 
not have mass. There is a possible world in which I have a twin brother; and so there is 
some possible object of which we can say, "He might possibly have been my twin 
brother". 

As for our counterexample to the converse Barcan formula, on the present reading of the 
quantifiers, the antecedent, DVxBy x = y, means that at each possible world, each 
possible thing is a possible thing, which is trivial. But likewise the consequent, 
VxD3y x = y, is trivial. It just says that for each possible thing, from the point of view 
of every possible world it is a possible thing. 

But now we have a different problem. We do not want our quantifiers always to be 
understood as ranging over everything possible. For example, if I say, "No one ever lives 
to be more than 120 years old", that might be true. But it will not be true if it means "No 
possible person lives to be more than 120 years old". To address this difficulty, we can 
introduce an existence predicate E (a frontwards "e", as distinguished from the backwards 
"e" of the existential quantifiers). Then when we want to say that all actual things are F, 
without also saying that all possible things are F, we can write: Vx(Ex -> Fx). And when 
we want to say that some actual thing is F, we write 3x(Ex a Fx). 

All of our definitions of a modal structure, and satisfaction, remain just the same as in 
Simple Quantified Modal Logic. It's just that we now think of the interpretation of E as 
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assigning to E at world w the subset of objects in D that actually exist in w. For example, 
we have: 

|= sqml D3x x = a, 

and even 

|= sqml^xDx = a, 

but 

^ SQ ml3xD(Ex a x = a). 

and 

(£ S qml3x(Ex a Dx = a). 

This might be the best we can do within the framework of a bivalent semantics. If you 
think that neither "Pegasus flies" nor "Pegasus does not fly" should count as true, 
because Pegasus does not exist, then you will need to adopt a three-valued semantics — to 
allow that both of these sentences are neither true nor false. 

Rigid designation 

We have assumed that in each structure, the assignment of objects to individual constants 
is constant across worlds. That is, for all individual constants n, for all w, w'G W, 
2(n, w) = 2(n, w). 

We can have a different kind of denoting expression, called a definite description, of the 
form: (ivG) (That's the Greek letter iota, followed by a variable, followed by a formula. 
The iota is supposed to be written upside down, but I don't think MS Word will let me do 
that.) We can think of this as a translation of "the G-thing". Definite descriptions, so 
written, form abbreviations. For any formula F, 

F(ivG)/u =^3v(Ev a G a Vx((Ex a Gx/v) -> x = v) a Fv/u). 

For example, "The little green man walked in" means: "There is a v such that v is an 
existing little green man and every existing little green man is v and v walked in". 
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Definite descriptions are non-rigid designators because the things they "denote" will not 
be the same in each world. 

The modal-logical difference between rigid designators and nonrigid designators is 
evident from the following: 

a = b |= sqml □ a = b 

(ixG) = (iyH) I^sqml □ (ixG) = (iyH) 

The reason for the latter is that even though, the unique thing in w that is G may be the 
unique thing in w that is H, it does not follow that in every possible world w'the unique 
thing in w' that is G is the unique thing in w' that is H. 

So, for example, while it may be true that Hesperus is Phosphorus and true that 
necessarily Hesperus is Phosphorus, and true that Hesperus is the first star seen in the 
evening and true that Phosphorus is the first star seen in the morning, and true that the 
first star seen in the evening is the first star seen in the morning, it is not true that 
necessarily the first star seen in the morning is the first star seen in the evening. 

Counterpart Theory 

There is at least one other idea from modal logic that frequently comes up in the 
philosophical literature, and that is the idea of counterparts. Some people, e.g., David 
Lewis, haven't liked the idea that a single individual can exist in different possible 
worlds, e.g., that I might belong to both the domain of objects in the actual world and 
belong to the domain of objects in some merely possible world. One thing they don't like 
about it is that it rules out the possibility that in some worlds there might be two people 
who have an equal claim on being me. And in some worlds there might be one person 
who has equal claim on being you and me. Suppose that you have an identical twin 
sister. But in some other world, your mother did not give birth to twins; she gave birth to 
just one daughter instead. In that other world, which person is the same person as you? 
Answer: that one daughter of your mother. But equally, that one daughter of your mother 
is the same person as your twin sister. (So being the same person as is not transitive.) 

To arrange things so that our formal semantics reflects this idea, we may suppose that for 
any two worlds w and w', the domain for w and the domain for w' are mutually exclusive. 
The extensions assigned to predicates at a world are formed exclusively from members of 
the domain for that world. We define intensional objects as relations whose members are 
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pairs consisting of a world and an object in the domain of that world (but the relation 
need not be a function). The objects assigned to individual constants and variables are 
such intensional objects. For example, if there are just three worlds, w\, W2, and w 3 , and 
o\ E 6(wi), 02 E 6(w2), and 03 E 0(^3), then an intensional object could be {(w\, o\), iyvi, 

2 ), (W 3 , O3)}. 

If we write a sentence using □ or () and some individual constant n, then in order to 
decide whether it is true in world w, we have to consider the intensional object assigned 
to n. For example, suppose Int is the intensional object assigned to the individual 
constant n in some structure. Then DFri will be true at w if and only if for all o E D and 
all w'E W, if wRw' and (w\ o) E Int, then (o) E 2(F, w). 



